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ABSTRACT 

We conduct a comprehensive analysis of the relationship between central galaxies and their host dark matter 
halos, as characterized by the stellar mass - halo mass (SM-HM) relation, with rigorous consideration of 
uncertainties. Our analysis focuses on results from the abundance matching technique, which assumes that 
every dark matter halo or subhalo above a specific mass threshold hosts one galaxy. We provide a robust 
estimate of the SM-HM relation for < z < 1 and discuss the quantitative effects of uncertainties in observed 
galaxy stellar mass functions (GSMFs) (including stellar mass estimates and counting uncertainties), halo mass 
functions (including cosmology and uncertainties from substructure), and the abundance matching technique 
used to link galaxies to halos (including scatter in this connection). Our analysis results in a robust estimate 
of the SM-HM relation and its evolution from z=0 to z=4. The shape and evolution are well constrained for 
Z < 1. The largest uncertainties at these redshifts are due to stellar mass estimates (0.25 dex uncertainty in 
normalization); however, failure to account for scatter in stellar masses at fixed halo mass can lead to errors 
of similar magnitude in the SM-HM relation for central galaxies in massive halos. We also investigate the 
SM-HM relation to z = 4, although the shape of the relation at higher redshifts remains fairly unconstrained 
when uncertainties are taken into account. We find that the integrated star formation at a given halo mass peaks 
at 10-20% of available baryons for all redshifts from to 4. This peak occurs at a halo mass of 7 x 10 n M© at 
z = and this mass increases by a factor of 5 to z = 4. At lower and higher masses, star formation is substantially 
less efficient, with stellar mass scaling as M» ~ Mjp at low masses andM, ~M°' 29 at high masses. The typical 
stellar mass for halos with mass less than 10 12 M© has increased by 0.3-0.45 dex for halos since z ~ 1. These 
results will provide a powerful tool to inform galaxy evolution models. 

Subject headings: dark matter — galaxies: abundances — galaxies: evolution — galaxies: stellar content — 
methods: N-body simulations 



1. INTRODUCTION 

A variety of physical processes are thought to be respon- 
sible for the observed distribution of galaxy properties, and 
distinguishing among them is one of the principal goals of 
modern galaxy formation theory. Among the relevant mech- 
anisms are those responsible for galaxy growth, such as star 
formation and galaxy mergers, as well as those responsible 
for regulating growth, including energetic feedback by super- 
novae, active galactic nuclei, cosmic ray pressure, and long 
gas cooling times. 

A fruitful approach to separating the influence of different 
mechanisms is to constrain the redshift-dependent relation 
between physical characteristics of galaxies, such as stellar 
mass, and the mass of their dark matter halos. This is possible 
because it is expected that many of these physical processes 
depend primarily on the mass of a galaxy's dark matter halo. 
By connecting galaxies to their parent halos, one is able to 
more clearly identify and constrain the physical processes re- 
sponsible for galaxy growth. 

The galaxy stellar mass - halo mass relation has additional 
utility because many properties of both galaxies and halos 
are tightly correlated with halo mass. In addition, the stel- 
lar mass - halo mass relation provides a mechanism for con- 
necting predictions for the halo mass function and the mass- 
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dependent spatial clustering of halos to the abundances and 
clustering of galaxies. If a model for galaxy evolution is 
able to reproduce the intrinsic galaxy mass - halo mass re- 
lation in the correct cosmological model, then such a model 
will match both the observed stellar mass function and the 
stellar mass dependent clustering of galaxies. Simultane- 
ously matching these two observational quantities and their 
evolution has been difficult with either hydrodynamical sim- 
ulations or semi-ana l ytic models o f galaxy formation (e.g., 
IWeinberg et alJl2004lUetaLll2007h . 

There are several ways to constrain the galaxy mass - 
halo mass relation. The first type of approach attempts 
to directly measure the mass of galactic halos. Tech- 
niques include weak lensing (e.g.. iGuzik & Seliakl 120021 : 
iSheldon et"aTll2004t iMandelbaum et al.ll2006l) and the use of 
satellite galaxy or stellar velocities as tracers of the halo po- 
tential well (e.g..lAshman et al.ll993fclZaritskv & Whitell99"4T: 
i Prada et all 120031: Ivan den Bosch et al. 2004; Conro vet alj 
2007). While such methods are a relatively direct probe of 
halo mass, they are limited in dynamic range; current obser- 
vations probe halo masses from roughly 10 12 — 10 14 M Q at the 
present epoch, and a smaller range at higher redshift. A sec- 
ond approach is to identify groups and clusters of galaxies 
either through optical or X-ray selected cluster catalogs, and 
then directly measure the i r galaxy content ( e.g., iLin & Mohrl 
120041: lHansen et al.ll2009t lYang et al.ll2007h . This is limited 
to relatively massive halos (although for optically identified 
groups it could extend to lower masses as new surveys probe 
dimmer galaxies in large enough volumes), and it also re- 
quires accurate knowledge of the mass-observable relation 
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(lYang et al.ll2007tlHansen et alJl2009h . 

An alternative approach is to assume that the properties 
of the halo population are known, for example from cos- 
mological simulations, and then find a functional form re- 
lating galaxies to halos which achieves agreement with a 
variety of observations. This approach is less direct but 
can be applied o ver a much larger dynamic range. Halo 



occupation (e.g., |_B erlind & Weinberg 



dynamic 

.. .. -ifl2002t iBullock et alJ 

2002 : ICoorav & Shethll2002t iTinker et al.ll2005t IZheng et alj 
20071) and conditional lumino sity function modeling (e.g., 
Yang et aD2003t ICoorayll2006l) fall into this category. 

In the past decade, a number of studies have found that 
this latter approach can be greatly simplified using a tech- 
nique called abundance matching. In its most basic form, 
the technique assigns the most massive (or the most lumi- 
nous) galaxies to the most massive halos monotonically. The 
technique thus requires as input only the observed abundance 
of galaxies as a function of mass, namely the galaxy stel- 
lar mass function (alternatively the galaxy luminosity func- 
tion) and the abundance of dark matter halos as a func- 
tion of mass, namely the halo mass function. This tech- 
nique has been shown to accurately reproduce a variety 
of observational results including various measures of the 
redshift- and scal e -dependent spatial cluste r ing of galaxies 
dColfn et all 1 19991: iKravtsov & Klypid[l99l iNevrinck et all 
2004t iKravtsoyet all 12004 iVale & Ostrikerl 12004 120061 
Tasitsiomi et alJ 12004 IConrovetal l 120061: IShankar et alj 
2006t iBerrier et al.ll200a iMarin et al.N2008t iGuo et al.ll2009L 
Moster et al.1 120091) . In the context of this technique, not 
only central halos but also subhalos (halos contained within 
the virial radii of larger halos) are included in the matching 
process, meaning that satellite galaxies can be accounted for 
without any additional parameters. 

Applications of the abundance matching technique have 
typically focused on using the default modeling assumptions 
to derive statistical information about the galaxy - halo con- 
nection. Uncertainties in the derived galaxy mass - halo mass 
relation have received little sy stematic attention i n the context 
of this technique (though see iMoster et al.ll2009l for a recent 
treatment of the attendant uncertainties). An accurate assess- 
ment of the uncertainties is necessary to make strong state- 
ments regarding the underlying physical processes responsi- 
ble for the derived galaxy - halo relation. In the present work 
we undertake an exhaustive exploration of the uncertainties 
relevant in constructing the galaxy stellar mass - halo mass re- 
lation from the abundance matching technique. We consider a 
range of uncertainties related to the observational stellar mass 
function, the theoretical halo mass function, and the underly- 
ing technique of abundance matching. The resulting galaxy 
stellar mass - halo mass relation and associated uncertain- 
ties will provide a benchmark against which galaxy evolution 
models may be fruitfully tested. 

This paper is divided into several sections. In ^2]we detail 
known sources of uncertainty which may affect our results. 
Our methodology for modeling the effects of uncertainties is 
discussed in $3] providing simple conversions where possi- 
ble to allow for different modeling choices. We present our 
resulting estimates of the galaxy stellar mass - halo mass re- 
lation for z < 1 in $4] and describe the contribution of each 
of the uncertainties to the overall error budget. Estimates for 
the evolution of the relation out to z ~ 4, for which the uncer- 
tainties are significantly less well-understood, are presented 
in $5] Finally, we discuss the implications of this work in ^6] 
and summarize our conclusions in $7] 



Stel lar masses throughout are quoted assuming a Chabrier 
d2003l) initial mass function (IMF), the stell ar population 
synthesis models o f iBruzual & Chariot! d2003l) . and the age 
and dust models in Blanton & Roweis (2007). We consider 
multiple cosmologies in this paper, but the main results as- 
sume a WMAP5+SN +BAO concordance ACDM cosmology 
dKomatsu et al.l i2009) with Q M = 0.27, A = 1 -Q M , h = 0.7, 
o-g = 0.8, and n s = 0.96. 

2. UNCERTAINTIES AFFECTING THE STELLAR MASS 
TO HALO MASS RELATION 

Uncertainties in the abundance matching technique for as- 
signing galaxies to dark matter halos can be conceptually sep- 
arated into three classes. The first is uncertainty in the abun- 
dance of galaxies as a function of stellar mass. This class 
includes both uncertainties in counting galaxies due to shot 
noise and sample variance, as well as uncertainties in the stel- 
lar mass estimates themselves. The second class concerns the 
dark matter halos, and includes uncertainties in cosmological 
parameters, the impact of baryon condensation, and substruc- 
ture. Finally, there are uncertainties in the process of match- 
ing galaxies to halos arising primarily from the intrinsic scat- 
ter between galaxy stellar mass and halo mass. Each of these 
sources of uncertainty are described in detail below. The de- 
tailed modeling of these uncertainties is described in $3] 

2.1. Uncertainties in the Stellar Mass Function 

Galaxy stellar masses are not measured directly, but are in- 
stead inferred from photometry and/or spectra. In particular, 
as the observed stellar light is a function of many physical 
processes (e.g., stellar evolution, star formation history and 
metal-enrichment history, and wavelength-dependent dust at- 
tenuation), stellar masses are estimated via complicated mod- 
els to find the best fit to galaxy observations in a very large 
parameter space. Assumptions and simplifications in these 
models, along with the fact that the best fit may be only one 
of a number of likely possibilities, mean that there can be sub- 
stantial uncertainties in calculated stellar masses. 

Different types of observations can yield different uncer- 
tainties in these calculations. Spectroscopic surveys generally 
recover more spectral features per galaxy and more accurate 
redshifts than photometric surveys. However, spectroscopy 
requires substantially more telescope time than photometry. 
Therefore, spectroscopic samples tend to be limited both in 
area and depth, which translates into limitations in both vol- 
ume and the minimum stellar mass probed. An additional 
problem for spectros copic surveys s uch as the Sloan Digital 
Sky Survey (SDSS. lYork et al.ll2000h is that the spectra probe 
only the central regions of galaxies (the SDSS spectra gather 
on average 1 /3 of the total flux from galaxies at z = 0.1). For 
galaxies containing both a bulge and a disk, or galaxies with 
radial gradients, the spectra wil l therefore not prov ide a fair 
sample of the entire galaxy (e.g.. lKewley et alJ2005l) : further- 
more, this bias will be a function of redshift. For these rea- 
sons, especially for robust comparisons of evolution, we con- 
fine this paper to photometric-based stellar masses; however, 
samples with spectroscopic redshifts are used where avail- 
able. 

2.1.1. Principal Uncertainties in Stellar Mass Functions 

In this paper, we analyze seven main sources of uncertain- 
ties in stellar mass functions applicable to photometric sur- 
veys, listed below in rough order of importance. 
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1. Choice of stellar initial mass function (IMF): The lu- 
minosity of stars scales as their mass to a large power 
(dlnL/dlnM — 3.5), while the IMF, defined as the num- 
ber of stars formed per unit mass, scales approximately 
as £ oc M" 2 3 (Sal peterlll955b . Thus, the light from a 
galaxy is dominated by the most massive stars, while 
the total stellar mass is dominated by the lowest mass 
stars. This fact, which has been known for decades 
(e.g., iTinslev & Gunn|[l976l) . implies that the assumed 
form of the IMF at M < 0.5M Q will have no effect on 
the integrated light from galaxies, but will have a large 
effect on the total stellar mass. Nonetheless, constraints 
on the total dynamical mass of spheroidal systems pro- 
vide a valuable inde pendent check on the fo rm of the 
IMF at low masses. Cappel lari et al.1 (|2006l) find, for 
example, that the IMF proposed by IChabrierl (2003) 
for the Solar neighborhood is consistent with dynam- 
ical constraints on masses of nearby ellipticals. 

We do not marginalize over IMFs in our error calcula- 
tions for the reason that i t is relatively si mple to convert 
from our choice of IMF (Chabrier 2003) to other IMFs. 
For our purposes the IMF becomes a serious source 
of systematic uncertainty only if the IMF varies with 
galaxy properties or if it evolves. It should be noted, 
however, that the IMF can also introduce more compli- 
cated systematic effects associated with the inferred star 
formation rate, which ma y in turn impact ste llar masses 
in non-trivial ways (e.g. IConroy eUdl20l(l 

2. Choice of Stellar Population Synthesis (SPS) model: 
SPS modeling efforts have grown subs tantially in so- 
phistication in the past decad e (e.g. iLeithereretaF 
1999k IBruzual & Chariot! 120031: iLelorgne et aUfcoTT 



Mara stonl 120051). Yet, significant u ncertainti es re 



main (e.g ICharlot et alJ[l99a |Charloll[T996t IYH I2001 



iLee et al] l2007h IConroy et al.l l2009h . Treatments of 
convection vary, leading to different main sequence 
turn off times for intermediate mass stars. Advanced 
stages of stellar evolution, including blue stragglers, 
thermally-pulsating AGB stars (TP-AGB), and hori- 
zontal branch stars, are poorly understood, both obser- 
vationally and theoretically. The theoretical spectral 
libraries contain known deficiencies, especially for M 
giants, where the effective temperatures are low and 
where hydrodynamic effects become important. Em- 
pirical stellar libraries to some extent circumvent these 
issues, although they are plagued by incomplete cover- 
age in the HR diagram and difficu lties associated with 
deri ving stellar parameters. See IConroy et alJ (2009) 
and lPercival & Salarisl (|2009) for recent reviews. 

Different stellar population synthesis models treat these 
issues differently, which can result in large system- 
atic differences in the derived st e llar m ass. For 
ins tance, the model o f iMarastonl (120051) compared 
to IBruzual & Charlofl d2003l) has systematic differ- 
ences of 0.1 dex in stellar m ass dSalimbeni et al.ll2.Q09t 
iPerez-Gonzalez et al.ll2008l) . Ho weverJ Salimben i et alJ 
(2009) reports that the model in iBruzuall (120071) (with 
a revised treatment of TP-AGB star s) yields systematic 
differe nces in stellar mass relative to Bruzual & CharloJ 
(2003), which ranges from 0.05de x for lO n M galax- 
ies to 0.3dex for 1O 9 5 M galaxies. IConroy et all ( 120091) 
show that uncertainty in the luminosity of the TP-AGB 



phase can shift stellar masses by as much as ±0.2 dex. 

3. Parameterization of star formation histories: In or- 
der to estimate stellar masses, model libraries are con- 
structed with a large range in star formation histories 
(SFHs), dust attenuation (see below), and, often but not 
always, metallicity. The adopted functional form of the 
SFH is another source of systematic uncertainty, as typ- 
ically very simple functional forms are assumed. Sev- 
eral authors have investigated various aspects of this 
problem . When attempting to mod el observed pho- 
tometry, IPerez-G onzalez et alJ (120081) found that a sin- 
gle stellar population model (in particular, star forma- 
tion proportional to e~'l r , which is a commonly-used 
parameterization) systematically underpredicts stellar 
mass by 0.18 dex compared to a double stellar popula- 
tion model (exponentially decaying star formation fol- 
lowed by a later starburst). The particular parameteri- 
zation of SFHs may also lead to systematic d ifferences 
as a function of stellar mass. ILee et al] (120091) analyzed 
a sample of mock Lyman-break galaxies at z ~ 4-5 
and found that simple SFHs produced best-fit stellar 
masses that were under or overestimated by ~ ±50% 
depending on the rest-frame galaxy color. This bias was 
attributed to the chaotic SFHs of the mock galaxies. 

4. Choice of dust attenuation model: Because dust red- 
dens starlight, it is difficult to separate the effects of 
dust from stellar population effects, especially when fit- 
ting optical photometry. The effects of dust are also 
known to change d epending on galaxy inclination (e.g., 
iDriver et~ail 120071) . Hence, the choice of dust attenu- 
ation law has a nontrivial effect on the inferred stel- 
lar population ages and, consequently, star formation 
histories derive d from photometric and even spectro- 
scopic surveys dPanter et alJ l2007[). In terms of the 
effect on stellar masses, IPerez-Gonzalez et all f2 008) 
compared the dust mo dels of Calzetti et al. (2000J) and 
ICharlot & Falll (2000). fi nding a systematic difference 
of O.lOdex. iPanter et aD (120071) found a similar differ- 
ence between Calzetti et al. and models based on ex- 
tinction curves from the Small and Large Magellanic 
Clouds. The effects of varying dust att enuation models 
have a lso b een explored r ecentl y by iMarchesin i et all 
(2009) and iMuzzin et alJ (120091) . with similar results. 
We have used t he stellar populatio n fitting proce- 
dure described in IConroy et alJ (120091) to compare the 
Calzetti et al. dust attenuation law to the dust model 
used in the kcorrect package (Blanton & Roweisl 
2007). We find a median offset of 0.02dex but also a 
systematic trend such that two galaxies whose stellar 
masses are estimated with Calzetti dust attenuation and 
are separated by l.Odex will have kcorrect masses 
separated by (on average) only 0.92dex. 

5. Statistical errors in individual stellar mass estimates: 
Stellar mass estimates for each galaxy are subject to 
statistical errors due to uncertainty in photometry as 
well as uncertainty in the SPS parameters for a given 
set of model assumptions. We treat this here as a ran- 
dom statistical error. While it may seem that random 
scatter in individual stellar masses should on average 
have no systematic effect, it in fact introd uces a sys- 
tematic error analogous to Eddington bias (Eddington 



4 



BEHROOZI, CONROY & WECHSLER 



1940) observed in luminosity functions. As the stellar 
mass function drops off steeply beyond a certain char- 
acteristic stellar mass, there are many more low stellar 
mass galaxies that can be up-scattered than there are 
high stellar mass galaxies that can be down-scattered 
by errors in stellar mass estimates. This asymmetric 
scatter implies that the drop-off in number density at 
high masses becomes shallower in the p resenc e of scat- 
ter. We discuss this effect in detail i n ^3.1.21 (see also 
the Appendix in lCattaneo et al.ll2008h . 

6. Sample variance: Surveys of finite regions of the Uni- 
verse are susceptible to large-scale fluctuations in the 
number density of galaxies. This is no longer a dom- 
inant source of uncertainty for the volumes probed at 
low redshift by the SDSS, but it is an important consid- 
eration for higher-redshift surveys, which cover much 
smaller comoving volumes. Most authors who con- 
sider sample variance atte mpt to minimize it by averag- 
ing over several fields ( e.g., |Perez-Gonzalez et all 2008: 
Marchesi ni et al.l 120091) . Very few surveys at z > at- 
tempt to estimate the magnitude of the error except by 
computing the field-to-field variance, which is often 
an underestimate w hen insufficient volume is probed 
(ICrocce et all 2009). We detail a more accurate method 
based on simulatio ns to m odel the error arising from 
sample variance in ^3 .2.4-1 

7. Redshift errors: Photometric redshift errors blur the 
distinction between GSMFs at different redshifts. 
While a galaxy may be scattered either up or down in 
redshift space, volume-limited survey lightcones will 
contain larger numbers of galaxies at higher redshifts, 
meaning that the GSMF as reported at lower redshifts 
will be artificially inflated. Moreover, as galaxies at ear- 
lier times have lower stellar masses, surveys will tend to 
report artificially larger faint-end slopes in the GSMF. 
However, as these errors are well known, it is easy to 
correct for their effects on the stellar mass function, 
as has been done for the d ata in |P erez-Gonzale z et all 
(2008) (see the appendix of lPerez-G onzalez et al.ll2005l 
for details on this process). 

For completeness, we remark that galaxy-galaxy lensing 
will also result in systematic errors in the GSMF at high red- 
shifts because galaxy magnification will result in higher ob- 
served luminosities. How ever, from ray-trac ing studies of 
the Millennium simulation (Hilbert et al. 2007), the expected 
scatter in galaxy stellar masses from lensing is minimal (e.g., 
0.04 dex at z = 1) compared to the other sources of scatter 
above (e.g., 0.25 dex from different model choices). For that 
reason, we do not model galaxy-galaxy lensing effects in this 
paper. 

2.1.2. Additional Systematics at z> 1 

Recently, it has become clear that current estimates of 
the evolution in the cosmic SFR density are not consistent 
with estim ates of the evo l ution of the stellar mass density 
at z > 1 ([N agamin e et al.l 120061: iH onkins & Beacom 120061 : 
iPerez-Gonzalez et alJ 120081: IWilkins et all l2008al) . The ori- 
gin of this discrepancy is currently a matter of debate. One 
soluti on involves allowing for an evol ving IMF with red- 
shift dDave1l2008t IWilkins etal.1 12008 a). While such a so- 
lution is controversial, a number of independent lines of 



evide nce suggest that the IMF was differe nt at high red- 
shift dLucatello et al.ll2005MTumlinsonll2007allb1:lvan DokkurrJ 
120081) . iReddv & Steidell d2009h offer a more mundane ex- 
planation for the discrepancy. They appeal to luminosity- 
dependent reddening corrections in the ultraviolet luminosity 
functions at high redshift, and demonstrate that the purported 
discrepancy then largely vanishes. 

In contrast to results at z > 1, there does seem to be 
an accord that for z < 1 both the integrated SFR and the 
total stellar mas s are in good a greement if one assumes 
(as we have) a IChabrieri (120031) IM F (see Wilki ns et alJ 
2008b : IFerez-Gonzalez et all 12001 IHonkins & Beacoml 
2006F lNagamine et al.ll2006l:IConrov & Wechsleril2009l) . 

Because of the discrepancy between reported SFRs and 
stellar masses in the literature, it is clear that estimates of un- 
certainties in galaxy stellar mass functions and SFRs at z > 1 
tend to underestimate the true uncertainties; for this reason, 
we separately analyze results for z < 1 in ^4] and z > 1 in ^5] 
of this paper. 

2.2. Uncertainties in the Halo Mass Function 

Dark matter halo properties over the mass range 10 °- 10 15 
Mp) have been ex t ensively analyzed i n simulations (e.g ., 
Uenkins et al.l l200lt IWarren et alj " l2006b iTinker et alJ 2008), 
and the overall c osmology has been constrained by probe s 
such as WMAP (ISpergel et all 120031: Homatsu et al] I2009h . 
As such, uncertainties in the halo mass function have on the 
whole much less impact than uncertainties in the stellar mass 
function. We present our primary results for a fixed cosmol- 
ogy (WMAP5), but we also calculate the impact of uncer- 
tain cosmological parameters on our error bars. We do not 
marginalize over the mass function uncertainties for a given 
cosmology, as the relevant uncertainties are constrained at the 
5% level (when b aryonic effects are neglected, see below; 
ITinker et al] 120081) . Additionally, in Appendix [Al a simple 
method is described to convert our results to a different cos- 
mology using an arbitrary mass function. For completeness, 
we mention the three most significant uncertainties here: 

1 . Cosmological model: The stellar mass - halo mass rela- 
tion has dependence on cosmological parameters due to 
the resulting differences in halo number densities. We 
investigate this both by calculating the relation for two 
specific cosmological modes (WMAP1 and WMAP5 
parameters) and then by calculating the uncertainties in 
the relation over the full range of cosmologies allowed 
by WMAP5 data. We find that in all cases these uncer- 
tainties are small compared t o the u ncertainties inher- 
ent in stellar mass modeling ( 32. 1.1b . although they are 
larger than the statistical errors for typical halo masses 
at low redshift. 

2. Uncertainties in substructure identification: Different 
simulations have different methods of identifying and 
assigning masses to substructure. Our matching meth- 
ods make use only of the subhalo mass at the epoch 
of accretion (M acc ) as this results in a better match to 
clustering and pair -count results dConrov et al] 120061 : 
Berrier et al. 2006), so we are largely immune to the 
problem of different methods for calculating subhalo 
masses. Of greater concern is the ability to reliably fol- 
low subhalos in simulations as they are tidally stripped. 
Two related issues apply here. The first is that it is not 
clear how to account for subhalos which fall below the 
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resolution limit of the simulation. The second is that 
the formation of galaxies will dramatically increase the 
binding energy of the central regions of subhalos, po- 
tentially making them more resilient to tidal disruption. 
Hydrodynamic simulations suggest that this latter effect 
is small except for subhalo s that orbit near the ce nters of 
the most massive clusters (Weinberg et al. 2008). How- 
ever, while these details are important for accurately 
predicting the clustering strength on small scales (< 1 
Mpc), they are not a substantial source of uncertainty 
for the global halo mass — stellar mass relation because 
satellites are always sub-dominant (< 20%) by num- 
ber. We discuss the analytic method we use to model 
t he sat ellite contribution to the halo mass function in 

3. Baryonic physics: Recent work by Stan ek et"ail d2009) 
suggests that gas physics can affect halo masses rela- 
tive to dark matter-only simulations by -16% to +17%, 
leading to number density shifts of up to 30% in the 
halo mass function at 10 14 M Q . Without evidence for a 
clear bias in one direction or the other — the models of 
gas physics still remain too uncertain — we do not apply 
a correction for this effect in our mass functions. Un- 
certainties of this magnitude are larger than the statis- 
tical errors in individual stellar masses at low redshift, 
but are still small in comparison to systematic errors in 
calculating stellar masses. 

For completeness, we note that the effects of sample vari- 
ance on halo mass functions estimated from simulations are 
small. Current simulations readily probe volumes of 1000 
(h~ l Mpc) 3 dTinkeretal.i r2008), and so the effects of sample 
variance on the halo mass function are dwarfed by the effects 
of sample variance on the stellar mass function; we therefore 
do not analyze them separately in this paper. 

We also remark on the issue of mass definitions. Al- 
though abundance matching implies matching the most mas- 
sive galaxies to the most massive halos, there is little consen- 
sus on which halo mass definition to use, with popular choices 
being M V! > (mass within the virial radius), M200 (mass within 
a sphere with mean density 200/9 cl -i t ), and M/ f (mass deter- 
mined by a friends-of-friends particle linking algorithm). We 
choose M V i r for this paper and note that the largest effect of 
choosing another mass algorithm will be a purely definitional 
shift in halo masses. We expect that scatter between any two 
of these mass definitions is degenerate with and smaller than 
the amount of scatter in stel lar m asses at fixed halo mass (the 
latter effect is discussed in j$!2.31 >. 

2.3. Uncertainties in Abundance Matching 

Finally, there are two primary uncertainties concerning the 
abundance matching technique itself: 

1 . Nonzero scatter in assigning galaxies to halos: While 
host halo mass is strongly correlated with stellar mass, 
the correlation is not perfect. At a given halo mass, 
the halo merger history, angular momentum properties, 
and cooling and feedback processes can induce scatter 
between halo mass and galaxy stellar mass. This is ex- 
pected to result in scatter in stellar of ~0. 1-0.2 dex at a 
given halo mass, see §3.3.1 for discussion. The scatter 
between halo mass and stellar mass will have system- 
atic effects on the mean relation for reasons analogous 



to those mentioned for statistical error in stellar mass 
measurements. At the high mass end where both the 
halo and stellar mass functions are exponential, scat- 
ter in stellar mass at fixed halo mass (or vice versa) 
will alter the average relation because there are more 
low mass galaxies that are upscattered than high mass 
galaxies that are downscattered. 

2. Uncertainty in Assigning Galaxies to Satellite Halos: 
It is not clear that the halo mass — stellar mass rela- 
tion should be the same for satellite and central galax- 
ies. Once a halo is accreted onto a larger halo, it starts 
to lose halo mass because of dynamical effects such as 
tidal stripping. While stripping of the halo appears to 
be a relatively dramatic process (e.g., iRravtsov et al.1 
2004), the stripping of the stellar component proba- 
bly does not occur unless the satellite passes very near 
to the central object because the stellar component is 
much more tightly bound than the halo. It is clear from 
the observed color-den sity relation dDresslerl 119801; 
iPostman & Gelled 1 1984t lHansen et al.H2009l) that star 
formation in satellite galaxies must eventually cease 
with respect to galaxies in the field. It is less clear how 
quickly star formation ceases, and whether or not there 
is a burst of star formation upon accretion. All of these 
issues can potentially alter the relation between halo 
and stel lar mass for satelli tes (although the modeling re- 
sults of IWang et al.l l2006 suggest that the halo-satellite 
relation is indistinguishable from the overall galaxy- 
halo relation). 

3. METHODOLOGY 

Our primary goal is to provide a robust estimate of the stel- 
lar mass - halo mass relation over a significant fraction of 
cosmic time via the abundance matching technique. We aim 
to construct this relation by taking into account all of the rel- 
evant sources of uncertainty. This section describes in de- 
tail a number of aspects of our methodology, including our 
approach for i ncorporating uncertainties in the stellar mass 
function f ^3. lb . a summary of the ado pted halo mass func- 
tions and associated uncertainties ( £13. 21, the uncertainties as- 
sociated with abundance matching ( 33. 31 , our choice of func- 
tional form for the stellar mass - halo mass relation, including 
a discus sion of why certain functions should be preferred over 
others ( 33.41 . and the Mark ov Chain Monte Carlo parameter 
estimation technique C ^3.5b . For readers interested in the gen- 
eral outline of our process but not the det ails, we conclude 
with a brief summary of our methodology Q3.6b . 

3.1. Modeling Stellar Mass Function Uncertainties 

As discussed in §|2] there are several classes of uncertainties 
affecting the way the stellar mass function is used in the abun- 
dance matching process. In this section, we discuss system- 
atic shifts in stellar mass estimates and the effects of statistical 
errors on the stellar mass function. 

3.1.1. Modeling Systematic Shifts in Stellar Mass Estimates 

Most studies on the GSMF report Schechter function fits 
as well as individual data points; many also provide statisti- 
cal errors. However, even when systematic errors are reported 
(either in Schechter parameters or at individual data points), 
the systematic error estimates are of limited value unless one 
is also able to model shifts in the GSMF caused by such er- 
rors. 
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Fortunately, based on the discussion in £12.1.11 there seem 
to be two main classes of systematic errors causing shifts in 
the GSMF: 

1 . Over/underestimation of all stellar masses by a constant 
factor /i. This appears to cover the majority of errors, 
including most differences in SPS modeling, dust atten- 
uation assumptions, and stellar population age models. 

2. Over/underestimation of stellar masses by a factor 
which depends linearly on the logarithm of the stel- 
lar mass (i.e., depends on a power of the stellar mass). 
This covers the majority of the remaining discrepancies 
between different SPS models and different stellar age 
models. 

Both forms of error are modeled with the equation 

log H^wJ =A1+Klogl0 l^rJ- (l) 

Without loss of generality, we may take Mp = lO l L3 Mfp (the 
fixed point of the variat ion between the Bruzuaj| [ 20071 and 
Bruzual & CharloJ 120031 models found by ISalimbeni et aD 
2009), allowing the prior on Mq to be absorbed into the prior 
on \i. 

For the prior on [i, we consider four contributing sources 
of uncertainty. We adopt estimates of the uncertainty from 
the SPS model (wO.ldex), the dust model (ssO.ldex), and as- 
sumptions about the star fo rmation history (saQ.2 dex) from 
iPerez-Gonzalez et al.1 (120081) as detailed in 32.1.11 Addition- 
ally, we have the variation in k log IO (Mo) (at most O.ldex, as 
|k| < 0.15 — see below). Assuming that these are statisti- 
cally independent, they combine to give a total uncertainty 
of 0.25dex, which is consistent with th e accepted range for 
system atic uncertainties in stellar ma ss (IPerez-Gonzalez et al.l 
2008t iKannappan & Gawiseril2007t Ivan der Wei et al.l l2006t 
Marchesini et al.l l2009h . For lack of adequate information 
(i.e., different models) to infer a more complicated distribu- 
tion, we assume that /x has a Gaussian prior. As more stud- 
ies of the overall systematic shift /i become available, our as- 
sumptions for the prior on /i and the probability distribution 
will likely need corrections. We remark, however, that our 
results can easily be converted to a different assumption for 
[i, as /j simply imparts a uniform shift in the intrinsic stellar 
masses relative to the observed stella r masses. 

For the prior on k, the result of Salimb eni et al.l {2009) 
would suggest \n\ < 0.15. As mentioned in §2.1.11 we f ound 
that |k | » 08 betw een the iBlanton & Row eis (200 7b and 
ICalzetti et ail (|2000) models for d ust attenuation. iLi & White! 
(2009 ) finds \k\ < .10 between IBlanton & Roweisl (120071) 
and iBell et all d2003l) stellar masses. Without a large num- 
ber of other comparisons, it is difficult to robustly determine 
the prior distribution for k; however, motivated by the results 
just mentioned, we assume that the prior on k is a Gaussian 
of width 0.10 centered at 0.0. 

We remark that some authors have considered much more 
complicate d parameterizatio ns of the systematic error. For 
example, ILi & Whitel (120091) considers a four-parameter hy- 
perbolic tangent fit to differences in the GSMF caused by 
different SPS models, as well as a five-parameter quartic fit. 
However, we do not consider higher-order models for system- 
atic errors for several reasons. First, given that second- and 
higher-order corrections will result only in very small correc- 
tions to the stellar masses in comparison to the zeroth-order 



correction (/i sa 0.25dex), the corrections will not substan- 
tially effect the systematic error bars. Second, we do not know 
of any studies which would allow us to construct priors on the 
higher-order corrections. Finally, with higher-order models, 
there is the serious danger of over-fitting — that is, with very 
loose priors on systematic errors, the best-fit parameters for 
the systematic errors will be influenced by bumps and wig- 
gles in the stellar mass function due to statistical and sample 
variance errors. Hence, the interpretive value of the system- 
atic errors becomes increasingly dubious with each additional 
parameter. 

3.1.2. Modeling Statistical Errors in Individual Stellar Mass 
Measurements 

In addition to the systematic effects discussed in the previ- 
ous section, measurement of stellar masses is subject to statis- 
tical errors. Even for a fixed set of assumptions about the dust 
model, SPS model, and the parameterization of star forma- 
tion histories, stellar masses will carry uncertainties because 
the mapping between observables and stellar masses is not 
one-to-one. This additional source of uncertainty has unique 
effects on the GSMF. Observers will see an GSMF (0 mea s) 
which is the true or "intrinsic" GSMF (<p t me) convolved with 
the probability distribution function of the measurement scat- 
ter. For instance, if the scatter is uniform across stellar masses 
and has the shape of a certain probability distribution P, we 
have: 

/oo 
0true ( 1 v ) P (y - log ! o (M) ) dy, (2) 
oo 

where y is the integration variable, in units of log 1() mass. As 
derived in Appendix |B] the approximate effect of the convo- 
lution is 

810 V <^ue(M) J 2 \ d\ogM J ' 

where a is the standard deviation of P. That is to say, the 
effect of the convolution depends strongly on the logarithmic 
slope of 0ti U e- Where the slope is small (i.e., for low-mass 
galaxies), there is almost no effect. Above 10 11 M Q , where 
the GSMF becomes exponential, there can be a dramatic ef- 
fect, with the result that <p tme is more than an order of mag- 
nitude less than </Was because it becomes far more likely that 
stellar mass calculation errors produce a galaxy of very high 
perceived stellar mass tha n it is for there to be such a galaxy 
in reality (see for example Cattane o et alj|2008r) . 

For the observed z ~ GSMF, we take the probabil- 
ity distribution P to be log-normal with la width 0.07dex 
from the analysis of t he photometry of low - redshift luminous 
red ga laxies (LRGs) (IConrov et al.l 120091) . iKauffmann et alj 
(2003) found similar results regarding the width of P. This 
function only accounts for the statistical uncertainties men- 
tioned above and does not include additional systematic un- 
certainties. In light of Equation [3] we use LRGs to esti- 
mate P because LRGs occupy the high stellar mass regime 
where measurement errors are most likely to affect the shape 
of the observed GSMF. However, the single most important 
attribute of the distribution P is its width; the main results do 
not change substantially if an alternate distribution with non- 
Gaussian tails beyond the 1<t limits of P is used. 

For higher redshifts, we scale the width of the probabil- 
ity distribution to account for the fact tha t mass estimates be- 
come less certain at higher redshift (e.g.. lConrov et al.1l2009t 
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iKaiisawa et ai]|2009h : 

P(Alog w M*,z) = ^rP ( ^rAlog 10 M* ) , (4) 

o(z) \ CT(Z) ) 

where Po is the probability distribution at z = (as discussed 
above), ao is the standard deviation of Pq, and a(z) gives the 
e volution of the standa rd deviation as a function of redshift. 

iConrov et al.l (120091) did not give a functional form for 
<j(z), but they calculate for a handful of massive galaxies that 
a(z = 2) is ft! 0.18dex , as compared to a(z = 0) ~ 0.07dex. 
Kaiisawa et al. (2009) performed a similar calculation (albeit 
with a different SPS model) of the distribution in several red- 
shift bins; their results show gradual evolution for a(z) out to 
z = 3.5 for high stellar mass galaxies consistent with a linear 
fit: 

er(z) = cr + a z z. (5) 

The results of Kaiisawa et al. (2009) suggest that a, =0.03- 
0.06 dex for LRGs. As this is consist ent with the value of oy = 
0.05 dex which would correspond to lConrov et al.l (120091) . we 
adopt the linear scaling of Equation [5] with a Gaussian prior 
of cr, = 0.05 ±0.015 dex. 

Note that the effect of this statistical error on the stellar 
mass function is minimal below 10 11 Mq, and therefore does 
not affect the stellar mass - halo m ass relation for halos below 
~ 10 13 Mq, as discussed in £14.21 While this scatter does have 
an effect on the shape of the stellar mass function for high- 
mass galaxies, the qualitative predictions we make from this 
analysis are generic to all types of random scatter. 

3.2. Halo Mass Functions 

The halo mass function specifies the abundance of halos 
as a function of mass and redshift. A number of analytic 
models and simulation-based fitting functions have been pre- 
sented for com puting mass functions given an input cos- 
mology (e.g.. iPress & Schechteri 11974 iJenkins et all |200TI 
IWarren et al.ll2006t iTinker et alj|2008l) . For mo st of our re- 
sults w e will adopt the universal mass function of Tinker et al. 
(2008), as described below. Analytic mass functions are 
preferable as they 1) allow mass functions to be computed for 
a range of cosmologies and 2) do not suffer significantly from 
sample variance uncertainties, because the analytic relations 
are typically calibrated with very large or multiple A^-body 
simulations. 

For some purposes it will be useful to also consider full 
halo merger trees derived directly from A^-body simulations 
that have sufficient resolution to follow halo substructures. 
The simulations used herein will be described below, in ad- 
dition to our methods for modeling uncertainties in the under- 
lying mass function, including cosmology uncertainties, sam- 
ple variance in the galaxy surveys, and our models for satellite 
treatment. 

3.2.1. Simulations 

For the principal simulation in this study ("L80G"), we 
used a pure dark matter N-body simul ation based on Adap- 
tive Refinement Tree ( ART) code (iKravtsov et al.l 11997b 
iKravtsov & KlvpirJ [l999). The simulation assumed flat, con- 
cordance ACDM (Q, M = 0.3, D, A = 0.7, h = 0.7, and <r g = 0.9) 
and included 512 3 particles in a cubic box with periodic 
boundary conditions and comoving side length 80/r'Mpc. 
These parameters correspond to a particle mass resolution of 



w 3.2 x lO^'M©. For this simulation, the ART code be- 
gins with a spatial grid size of 512 3 ; it refines the grid up to 
eight times in locally dense regions, leading to an adaptive 
distance resolution of w 1.2/r'kpc (comoving units) in the 
densest parts and w 0.31/r'Mpc in the sparsest parts of the 
simulation. 

In this simulation, halos and subhalos were identified 
using a variant of the Bound Density Maxima algorithm 
(Klvpi net al.ll 19991) . Halo centers are located at peaks in the 
density field smoothed over a 24-particle SPH kernel (for a 
minimum resolvable halo mass of 7.7 x 10 9 /i _1 Mq). Nearby 
particles are classified as bound or unbound in an iterative 
process; once all the locally bound particles have been found, 
halo parameters such as the virial mass M wu and maximum 
circul ar velocity V max may be calculated. (See lKravtsov et all 
2004 for complete details on the algorithm). The simulation 
is complete down to V max « 100 km s" 1 , corresponding to a 
galaxy stellar mass of 10 8 - 75 M Q atz = 0. 

The ability of L80G to track satellites with high mass and 
force resolution gives it several uses. Merger trees from L80G 
inform our prescription for converting analytical central-only 
halo mass func tions t o mass functions which include satel- 
lite halos (see 93.2.21 ). Additionally, the merger trees allow 
for evaluation of different models of satellite stellar evolution 
with full consistency (see 93.3.21 1. Finally, the knowledge of 
which satellite halos are associated with which central halos 
allows for estimates of the total stellar mass (in th e cent ral and 
all satellite galaxies) — halo mass relation (see 94.3.6b . 

We also make use of a secondary simulation from the 
Large Suite of Dark Matter Simulations (LasDamas Project, 
http://lss.phy.vanderbilt.edu/lasdamas/) in our sample vari- 
ance calculations. The L80G simulation is too small for use in 
calculating the sample variance between multiple independent 
mock surveys, but the larger size of the LasDamas simulation 
(420 h~ l Mpc, 1400 3 particles) makes it ideal for this purpose. 
However, the LasDamas simulation has poorer mass resolu- 
tion (a minimum particle size of 1.9 x 10 9 M Q ) and force 
resolution (8 h~ l kpc), making it unable to resolve subhalos 
(particularly after accretion) as well as L80G. The LasDamas 
simulation assumes a flat, ACDM cosmology (17^ = 0.25, 
17a = 0.75, h = 0.7, and erg = 0.8) which is very close to the 
WMAP5 best-fit cosmology dKomatsu et al.l2009l) . Collision- 
less gr avitational evo lution was provided by the GADGET-2 
code (Springel 2005). Halos are identified using friends of 
friends with a l inking length of 0.164. The subfind algorithm 
Springel (2005) is used to identify substructure. 

As mentioned, the primary use of the LasDamas simulation 
is in sampling the halo mass functions in mock surveys to 
model the effects of sample variance on high-redshift pencil- 
beam galaxy surveys. The mo ck surveys are constructed s o 
as to mimic the observations in lPerez-Gonzalez et al.l (2008). 
In each mock survey, three pencil-beam light cones (matching 
the an gular sizes of the three fields in Perez-Gonzal ez et al.1 
2008) with random orientations are sampled from a random 
origin in the simulation volume out to z = 1.3. Thus, by com- 
paring the halo mass functions in individual mock surveys to 
the mass function of the ensemble, the effects of sample vari- 
ance may be calculated with full consideration of the correla- 
tions between halo counts at different masses. 

3.2.2. Analytic Mass Functions 

The analytic mass functions of lTinker et al.l (120081) are used 
to calculate the abundance of halos in several cosmological 
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Figure 1. Differences between the fitted Schechter function parameters for the satellite halo mass (at accretion) function and the central halo mass function, as 
a function of scale factor; e.g., Alog I0 c/>o corresponds to log 10 ((/>o sats /0o.centials)- The black lines are calculated from a simulation using WMAP1 cosmology 
(L80G), and the red lines represent the fits to the simulation results in Equation[7j 



models. We calculate mass functions defined by M v ; r , using 
the overdensity specified by iBryan & Normanl ([1998) This 
results in an overdensity (compared to the mean background 
density) A v „ which ranges from 337 at z = to 203 at z = 1 and 
smoothly approach es 180 at very high redshifts. Following 
iTinker et alj ([2008), we use spline interpolation to calculate 
mass functions for overdensities between the discrete inter- 
vals presented in their p aper. 

The mass functions in Ti nker et alJ (120081) only include cen- 
tral halos. We model the small («20% at z = 0) correction to 
the mass function introduced by subhalos to first order only, 
as the overall uncert ainty in the central halo mass function is 
already of order 5% (Tin ker et al.l20 08). In particular, we cal- 
culate satellite (mass at accretion) and central mass functions 
in our simulation (L80G) and fit Schechter functions to both, 
excluding halos below our completeness limit (10 10 3 M ). 
Then, we plot the difference between the Schechter param- 
eters (the difference in characteristic mass, Alog.QM*; the 
difference in characteristic density, Alog 10 ^>o; and the dif- 
ference in faint-end slopes, Aa) as a function of scale factor 
(a). This gives the satellite mass function (<fi s ) as a function 
of the central mass function (<j> c ), which allows us to use this 
(first-order) correction for central mass functions of different 
cosmologies: 

t s (M)=10^° (—^—^ A ° MM/10*****). 

_ (6) 

From our simulation, we find fits as shown in Figure [To 

Alog 10 <7io(a) = -0.736-0.213fl, 

Alog 10 M (fl) = 0.134-0.306fl, (7) 

Aa(a) = -0.306 + 1 .08 a - 0.570 a 2 . 

The mass function used here may be be easily replaced by an 
arbitrary mass function, as detailed in AppendixlAl 

3.2.3. Modeling Uncertainties in Cosmological Parameters 

Our fiducial results are calculated assuming WMAP5 cos- 
mological parameters. In order to model uncertainties in cos- 
mological parameters, we have sampled an additional 100 
sets of cosmological parameters from the WMAP5+BAO+SN 

3 A vil = (18tt 2 + 82x-39.t 2 )/(1 +x); x = (1 + P k(z)I Pm(z)T 1 - 1 

4 Comparin g these fits to satellite mass functions from a more recent sim- 
ulation i Klvpin et al. 2010, the "Bolshoi" simulation), we have verified that 
applying these fits to mass functions for the WMAP5 cosmology introduces 
errors only on the level of 5% in overall number density, similar to the uncer- 
tainty with which the mass function is known. 



MCMC chains (from the models in iKomatsu etaD 120091) 
and generated mass functions for each one according to the 
method in the previous section. Hence, to determine the vari- 
ance in the derived stellar mass - halo mass relation caused by 
cosmology uncertainties, we recalculate the relation for each 
sampled mass function according to the method described in 
AppendixlAl 

3.2.4. Estimating Sample Variance Effects for the Stellar Mass 

Function 

Large-scale modes in the matter power spectrum imply that 
finite surveys will obtain a biased estimate of the number den- 
sities of galaxies and halos as compared to the full universe. 
That is to say, matching observed GSMFs measured from a 
finite survey to the halo mass function estimated from a much 
larger volume will introduce systematic errors into the result- 
ing SM-HM relation. These errors cannot be corrected unless 
one has knowledge of the halo mass function for the specific 
survey in question, which is in general not possible. 

However, we can still calculate the uncertainties introduced 
by the limited sample size. While we cannot determine the 
true halo mass function for the survey, we can calculate the 
probability distribution of halo mass functions for identically 
shaped surveys via sampling lightcones from simulations. If 
we rematch galaxy abundances from the observed GSMF to 
the abundances of halos in each of the sampled lightcones, 
then the uncertainty introduced by sample variance is exactly 
captured in the variance of the resulting SM-HM relations. 

In detail, we create our distribution of halo mass func- 
tions by sampling one t housan d mock surveys from the Las- 
Damas simulation (see 33.2. U corresponding to the e xact sur- 
vey parameters used in iPerez- Gonzalez et alj (120081) . We fit 
Schechter functions to the halo mass functions of each mock 
survey (over all redshifts), and we calculate the change in 
Schechter parameters (Alog 10 (/Jo, Alog 10 Mo, and Aa) as 
compared to a Schechter fit to the ensemble average of the 
mass functions. Using the distribution of the changes in 
Schechter parameters, we may mimic to first order the ex- 
pected distribution of halo mass functions for any cosmology. 
In particular, we use an equation exactly analogous to Equa- 
tion|6]to convert the mass function for the full universe C^fuii) 
and the distribution of Alog 10 <75o, Alog 10 Mo, and Aa into a 
distribution of possible survey mass functions (</> bs): 

obs (M)=lO A1 °^o ^ _Jl__ j % fun(M/10 A,o gl0 M„ x 

(8) 

Hence, to obtain the variance in the stellar mass - halo mass 
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relation caused by finite survey size, we recalculate the rela- 
tion for each one of the survey mass functions thus computed 
according to the method described in AppendixlAl 

3.3. Uncertainties in Abundance Matching 

3.3.1. Scatter in Stellar Mass at Fixed Halo Mass 

An important uncertainty in the abundance matching pro- 
cedure is introduced by intrinsic scatter in stellar mass at a 
given halo mass. Suppose that M*(M^) is the average (true) 
galaxy stellar mass as a function of host halo mass. For a 
perfect monotonic correlation between stellar mass and halo 
mass, i.e., without scatter between stellar and halo mass, it 
is straightforward to relate the true or "intrinsic" stellar mass 
function (<f) tme ) to the halo mass function (0/,) via 



dN 



dN dlogM h 



d\og 1Q M* d\og l0 M h c/logM* 
where N is the number density of galaxies, so that 



4>tm e (M*(M h )) = (f> h (Mh) 



d\ogM*{M h ) 
d\ogM h 



(9) 



(10) 



Intuitively, as the halos of mass M/, get assigned stellar masses 
of M*(M/,), the number density of galaxies with mass M*(M/,) 
will be proportional to the number density of halos with mass 
M/,. The above equations are simply a mathematical represen- 
tation of the traditional abundance matching technique. 

Equation[lO]remains useful in the presence of scatter. If we 
know the expected scatter about the mean stellar mass, say in 
the form of a probability density function P v (Alog 10 M* |M/,), 
then we may still relate </> trae to 0/, via an integral similar to a 
convolution: 

</>true (X) = J 4>h(Mh(M*)) —± X 

u dlogM* 

xP s {log lQ ^-\M h {M,))dlog lQ M„ (11) 

where M/,(M») is the inverse function of M*(M/,). 

This similarity to a convolution is no coincidence — 
mathematically, it is analogous to how we mo del ran dom sta- 
tistical errors in stellar mass measurements in 33.1.21 Namely, 
if we define ^direct to equal the right-hand side of EquationfTUl 



6direct(M») = (t> h (M h (M*)) 



dlogM h 
c/logM* 



(12) 



and if we assume a probability density distribution indepen- 
dent of halo mass (i.e., scatter in stellar mass at fixed halo 
mass is independent of halo mass), then <fi lme is exactly re- 
lated to indirect by a convolution: 



e(M») : 



£dtot(10V,(y-log 10 M*>/)>, (13) 



which is mathematically identical to Equation|2]in 33.1.21 

Then, if one calculates indirect from </> true , one may find 
M/,(M„) via direct abundance matching. Namely, integrating 
equation fT2l we have: 

4> h {M)d\0g w M = / (^directtAQflflog^M*. (14) 

M k (M,) JM, 

Equivalently, letting 3>/,(M/,) = J™ (f>i,(M)d log 10 M be the 
cumulative halo mass function, and letting <£> direct (M*) = 



</ > direct(^*)^log 10 M» be the cumulative "direct" stellar 
mass function, we have 



M, I (M,)=$7 1 ($ dl rect(M,)), 



(15) 



and one may similarly find M„(M/,) by inverting this relation. 

Our approach in all equations except for Equation [13] al- 
lows a halo mass-dependent scatter in the stellar mass, but to 
date the data appears to be consistent with a constant scatter 
val ue. For example, using the kinematics of satellite galax- 
ies, iMore et ail d2009l) finds that the scatter in galaxy lumi- 
nosity at a given halo mass is 0.16 ±0.04 dex, i ndependent 
of halo mass. Using a catalog of galaxy groups, lYang et aTl 
(2009b) find a value of 0.17 dex for the scatter in the stel- 
lar mass at a given halo mass, also independent of halo mass. 
Here, we thus assume a fixed value for the scatter in stellar 
mass at fixed halo mass, £, to specify the sta ndard deviation 
of P t (Alog, n M*). As the 



Yanget al. (2009b) value is consis- 



ts 

tent with the More et al. (2009) value, we set the prior using 



the IMore et alj (2009) value and error bounds on £, We as- 
sume a Gaussian prior on the probability distribution for £, 
and we assume that the scatter itself is log-normal. 

3.3.2. The Treatment of Satellites 

When a galaxy is accreted into a larger system, it will likely 
be stripped of dark matter much more rapidly than stellar mass 
because the stars are much more tightly bound than the halo. 
It has been demonstrated that various galaxy clustering prop- 
erties compare favorably to samples of halos where satellite 
halos — i.e., subhalos — are selected according to their halo 
mass at the ep och of accretion, M acc , rather than their cur- 
rent mass (e.g..]Nagai & Kravtsovll2005t IConrov et al.ll2006l: 
IVale & Ostr iker 2006; Berrier et al. 2006]). These results sup- 
port the idea that satellite systems lose dark matter more 
rapidly than stellar mass. 

As commonly implemented (e.g. IConrov et al J 120061) . the 
abundance matching technique matches the stellar mass func- 
tion at a particular epoch to the halo mass function at the same 
epoch, using M acc rather than the present mass for subhalos. 
As M acc remains fixed as long as the satellite is resolvable, the 
standard technique implies that the satellite galaxy's stellar 
mass will continue to evolve in the same way as for centrals 
of that halo mass. Therefore, a subtle implication of the stan- 
dard technique is that satellites may continue to grow in stellar 
mass, even though M acc remains the same. A different model 
for satellite stellar evolution (e.g., in which stellar mass which 
does not evolve after accretion) would therefore involve dif- 
ferent choices in the satellite matching process. 

The fiducial results presented here use the standard model 
where satellites are assigned stellar masses based on the cur- 
rent stellar mass function and their accretion-epoch masses. 
However, we also present results for comparison in which 
satellite masses are assigned utilizing the stellar mass func- 
tion at the epoch of accretion, corresponding to a situation in 
which satellite stellar masses do not change after the epoch 
of accretion. In order to maintain self-consistency for the lat- 
ter method, we use full m erger trees (from L80G, the simu- 
lation described in 33.2. j} to keep track of satellites and to 
assure that, e.g., mergers between satellites before they reach 
the central halo preserve stellar mass. 

Finally, we note that any specific halo-finding algorithm 
may introduce artifacts in the halo mass function in terms 
of when a satellite halo is considered absorbed/destroyed. 
This can have a small effect on satellite clustering as well as 
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number density counts. Wet zel & White! d2009) suggest an 
approach that avoids some of the problems associated with 
resolving satellites after accretion. Namely, they suggest a 
model where satellites remain in orbit for a duration that is a 
function of the satellite mass, the host mass, and the Hubble 
time, after which time they dissolve or merge with the cen- 
tral object. Although we have not modeled this explicitly, our 
satellite counts are consistent with their recommended cutoff 
— they suggest considering a satellite halo absorbed when its 
present mass is less than 0.03 times its infall mass; in our sim- 
ulation, only 0. 1 % of all satellites fall below this threshold. 

3.4. Functional Forms for the Stellar Mass - Halo Mass 
Relation 

In order to determine the probability distribution of our un- 
derlying model parameters, we must first define an allowed 
parameter space for the stellar mass - halo mass relation. Ide- 
ally, one would like a simple, accurate, physically intuitive, 
and orthogonal parameterization; in practice, we seek the best 
compromise with these four goals in mind. We consider one 
of the most popular methods for choosing a functional form 
(indirect parameterization via the stellar mass function) be- 
fore discussing the method we use in this paper (parameteri- 
zation via deconvolution of the stellar mass function). 

3.4.1. Parameterizing the Stellar Mass Function 

In abundance matching, knowledge of the halo mass func- 
tion and the stellar mass function uniquely determines the 
stellar mass - halo mass relation. Hence, parameterizing 
the stellar mass function yields an indirect parameterization 
for the stellar mass - halo mass relation as well. Numer- 
ous papers (e.g.lCole et al.l200UlBell et alj2003HPanter et alj 
12004 iPerez-Gon zalez et al. 2008) have found that the GSMF 
is well-approximated by a Schechter function: 

/ M \~ a(z) f M \ 

^=^\m) exp U(i))' (16) 

where the Schechter parameters </>*(z), M(z), and a(z) evolve 
as functions of the redshift z. In many previ ous works 
on abundance matching (e.g. iConrov et al.1 12009b . it is the 
Schechter function for the stellar mass function that sets the 
form of the SM-HM relation. 

More recently, however, several authors have noted that 
the GSMF cannot be matched by a single Schechter function 
for z < 0.2 to within statistical errors (e.g. lLi& Whlte1l2009l: 
Baldry et al. 2008), in part because of an upturn in the slope 
of the GSMF for galaxies below 10 9 M Q in stellar mass. It 
is possible that a conspiracy of systematic errors causes the 
observed deviations, but there is no fundamental reason to ex- 
pect the intrinsic GSMF to be fit exactly by a Schechter func- 
tion (see discussion in Appendix P. In any case, our full pa- 
rameterization — either the stellar mass function or the error 
parameterization — must be able to capture all the subtleties 
of the observed stellar mass function. Hence, we are inclined 
to adopt a more flexible model than the Schechter function of 
equation[l6] Other authors, wrestling with the same problem, 
have chosen to adopt multiple Schechter functions, includ- 
ing the ele ven-parameter tripl e piecewise Schechter-function 
fit used bv lLi & White! (12009). While accurate, these models 
often add complication without increasing intuition. 

3.4.2. Deconvolving the Observed Stellar Mass Function 




11 12 13 14 15 
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Figure 2. Relation between halo mass and stellar mass in the local Universe, 
obtain ed via direct deconvolution of the stellar mass function in lLi & White! 
12009) matched to halos in a WMAP5 cosmology. The deconvolution in- 
cludes the most likely value of scatter in stellar mass at a given halo mass as 
well as statistical errors in individual stellar mass es. The direct deconvolution 
(solid line) is compared to the best fit to Ea. \21\ (red dashed line). 

Rather than attempting to parameterize the stellar mass 
function, we could use abundance matching directly to de- 
rive the stellar mass - halo mass relation for the maximal- 
likelihood stellar mass function, and then find a fit which can 
parameterize the uncertainties in the shape of the relation. 
This process is complicated by the various errors which we 
must take into account. Recall from Equations l2land[T3l that 

0mea S (M») = (/"direct (M*) oP s (AlOg 10 M*) O^Alog^M,), 

(17) 

(where "o" denotes the convolution operation, P s is the prob- 
ability distribution for the scatter in stellar mass at fixed halo 
mass, and P is the probability distribution for errors in ob- 
served stellar mass at fixed true stellar mass). However, 
if we obtain ^direct by deconvolution of the observed stellar 
mass function mea s, we may use direct abundance matching 
( Equation [T5ll to determine the maximum likelihood form of 
M h (M*). 

Figure |2] shows the result of calculating M/,(M») at z ~ 0.1 
via deconvolution and direct matching of the stellar mass 
function as described in the previous section. We choose 
the maximum-likelihood value for the distribution function P s 
(namely, 0.16 dex log-normal scatter), and we use WMAP5 
cosmology for the halo mass function </>/, in the derivation. 

While deconvolution plus direct abundance matching gives 
an unbiased calculation of the relation, there are several prob- 
lems which prevent it from being used directly to calculate 
uncertainties: 

1 . Deconvolution will tend to amplify statistical variations 
in the stellar mass function — that is, shallow bumps 
in the GSMF will be interpreted as convolutions of a 
sharper feature. 

2. Deconvolution will give different results depending on 
the boundary conditions imposed on the stellar mass 
function (i.e., how the GSMF is extrapolated beyond 
the reported data points) — the effects of which may be 
seen at the edges of the deconvolution in Figure [2] 

3. Deconvolution becomes substantially more problem- 
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atic when the convolution function varies over the red- 
shift range, as it does for our higher-redshift data (z > 
0.2). 

4. Deconvolution cannot extract the relation at a single 
redshift — instead, it will only return the relation aver- 
aged over the redshift range of galaxies in the reported 
GSMF. 

For these reasons, we choose to find a fitting formula in- 
stead. In the discussion that follows, we fit M^(M«) (the halo 
mass for which the average stellar mass is M*) rather than the 
more intuitive (M/,) (the average stellar mass at a halo mass 
M/,) primarily for reasons of computational efficiency. From 
Equations[l2]and[T7j the calculation of what observers would 
see (0 m eas) for a trial stellar mass - halo mass relation requires 
many evaluations of M/^M*) and no evaluations of M^M/,). 
If we had instead parameterized M^M/,), and then inverted as 
necessary in the calculation of <fi me - ds , our calculations would 
have taken an order of magnitude more computer time. 

3.4.3. Fitting the Deconvolved Relation 

It is well-known from comparing the GSMF (or the lu- 
minosity function) to the halo mass function that high-mass 
(M* > 10 10 5 Mq) galaxies have a significantly different stel- 
lar mass-halo mass scaling than low-mass galaxies, which 
is usually attributed to different feedback mechanisms dom- 
inating in high-mass vs. low-mass galaxies. The transition 
point between low-mass and high-mass galaxies — seen as a 
turnover in the plot of M/,(M„) around = 10 10 6 M in Fig- 
ure |2] — defines a characteristic stellar mass (M* o) and an as- 
sociated characteristic halo mass (Mi). Hence, we consider 
functional forms which respect this general structure of a low 
stellar mass regime and a high stellar mass regime with a char- 
acteristic transition point: 

log 10 (M/,(M*)) = log 10 (Mi) [Characteristic Halo Mass] 
+fiow(M le /Mi e p) [Low-mass functional form] 
+//„ g /,(M„/M >t ,o) [High-mass functional form] 

where fi ow and are dimensionless functions dominating 
below and above M*.o, respectively. 

For low-mass galaxies (M* < 1O 1O 5 M ), we find the stellar 
mass - halo mass relation to be consistent with a power-law: 

Mi ~\M*.o) '° r 
\og{M h {M*))^\og{M x ) + f3\ g . (18) 

For high-mass galaxies, we find the stellar mass-halo mass 
relation to be inconsistent with a power-law. In particu- 
lar, the logarithmic slope of M/,(M„) changes with M*, with 
(ilogM/j/iilogM* always increasing as M* increases. This 
may seem like a small detail; after all, by eye, it appears that 
a power law could be a reasonable fit for high-mass galax- 
ies in F igure [2] In addition, because previous authors (e.g., 
Mos ter et al.ll2009t lYang et al.ll2009al) have used power laws, 
it may not seem necessary to use a different functional form. 

In order to explore this issue, we tried a general dou- 
ble power-law functional form for Mh(M^) whi c h pa rame- 
terized a superset o f the fits used in iMoster et alj d2009f) and 
Yan g et al. (2009a) (in particular, the same form as in Equa- 
tion lC2l in AppendixICt. We found that this approach had two 
major problems common to any such power-law form: 



1. As the logarithmic slope of M/,(M*) increases with 
increasing M„, the best-fit power-law for high-mass 
galaxies will depend on the upper limit of M* in the 
available data for the GSMF. Thus, the best-fit power- 
law will depend on the number density limit of the 
observational survey used — rather than on any funda- 
mental physics. Moreover, for studies such as this one 
which consider redshift evolution, the different number 
densities probed at different redshifts result in a com- 
pletely artificial "evolution" of the best-fit power-law. 

2. The best-fit power-law will not depend on the high- 
est mass galaxies alone; instead, it will be something 
of an average over all the high-mass galaxies. Because 
the logarithmic slope is increasing with M*, this means 
that the best-fit power law for M/,(M») will increasingly 
underestimate the true M/,(M*) at high M*. Namely, 
the fit will underestimate the halo mass correspond- 
ing to a given stellar mass, and therefore (as lower- 
mass halos have higher number densities) result in stel- 
lar mass functions systematically biased above observa- 
tional values. However, a systematic bias in our func- 
tional form will influence the best-fit values of the sys- 
tematic error parameters. The systematic bias caused 
by assuming a power-law form turns out to be most de- 
generate with the scatter in stellar mass at fixed halo 
mass (£). As a result, for the MCMC chains which as- 
sumed a double power-law form for Mh(M^), the pos- 
terior distribution of £ was 0.09±0.02 dex, which just 
barely lies within 2a of the constraints from Mor e et al.l 
(120091) . 

These problems are not as significant if one only considers 
the stellar mass function at a single redshift, or if one does not 
allow for the systematic errors which change the overall shape 
of the stellar mass function (k, £, and <r(z)). However, we find 
that the issues listed above exclude the use of a power-law 
for our purposes. Instead, we find that Mi,(M*) asymptotes 
to a sub-exponential function for high M*, namely, a function 
which climbs more rapidly than any power-law function, but 
less rapidly than any exponential function. We find that high- 
mass galaxies (M* > 10 10 5 Mq) are well fit by the relation 

M h {M*) oc lofe) , or 

log 10 (M A (M»))->log 10 (Mi) + (M^-) (19) 

\M*fi J 

where 8 sets how rapidly the function climbs; 8 — » would 
correspond to a power-law, and 8=1 would correspond to a 
pure exponential. Typical values of <5 at z = range from 0.5 - 
0.6. It is not obvious what physical meaning can be directly 
inferred from the choice of a sub-exponential function — after 
all, the stellar mass of a galaxy is a complicated integral over 
the merger and evolution history of the galaxy — but it could 
suggest that the physics driving the Mh(M^) relation at high- 
mass is not scale-free. 

Although this form now matches the asymptotic behavior 
for the highest and lowest stellar mass galaxies, one addi- 
tional parameter is necessary to match the functional form of 
the deconvolution. That is to say, galaxies in between the ex- 
tremes in stellar mass will lie in a transition region, as they 
may have been substantially affected by multiple feedback 
mechanisms. The width of this transition region will depend 
on many things — e.g., how long galaxies take to gain stellar 
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mass, how much of the stellar mass present came from quies- 
cent star formation as opposed to mergers, and the degree of 
interaction between multiple feedback mechanisms. Hence, 
instead of having M/,(M*) become suddenly sub-exponential 
for galaxies larger than M»,o, we allow for a slow "turn-on" of 
the more rapid growth. The behavior of M/,(M*) is best fit by 
modifying the previous equation to 



log 10 (Af A (Af*)) log 10 (M 1 ) + ■ 



\M,.o J 



1 + 



\M,,0 J 



(20) 



The denominator, 1 + (M*/M* o)" 7 , is large for M* < Af* o» 
and it falls to unity for M» > Af* o at a rate controlled by 7. A 
larger value of 7 implies a more rapid transition between the 
power-law and sub-exponential behavior (typical values for 
(7) atz = 0are 1.3-1.7). As the non-constant piece of M/,(M„) 
in Equation|20]is 5 forM* =M* ) o, we add a final factor of-^ 
to compensate so that M/,(M*.o) = Mi . 

To summarize, our resulting best-fit functional form has 
five parameters: 

log 10 (M A (M*)) = 



log 10 (M l) + /31og I0 [^V 



1 



Where Mi is a characteristic halo mass, M* o is a characteristic 
stellar mass, /3 is the faint-end slope, and 6 and 7 control the 
massive-end slope. The best fit using this functional form is 
shown in Figure and it achieves excellent agreement over 
the entire range of stellar masses. 

Deconvolving the GSMF at higher redshifts does not sug- 
gest that anything more than linear evolution in the parame- 
ters is necessary, at least out to z = 1 . While the characteristic 
mass of the GSMF and the characteristic mass of the halo 
mass function certainly evolve, the change in the shapes of 
the two functions is relatively slight. As we wish for the func- 
tional form to have a natural extension to higher redshifts, we 
parameterize the evolution in terms of the scale factor (a): 

log 10 (Mi (a)) = Mi , +Mi . a (a - 1 ) , 
log 1Q (A/* ) o(a))=Af* )Q) o+M» i o, fl (a-l) ) 

(3(a) = f3 + f3 a (a-l), (22) 
5(a) = S + S a (a-l), 
7(a) = 7o+7o(a-l), 
where a = 1 is the scale factor today. 

3.5. Calculating Model Likelihoods 

We make use of a Markov Chain Monte Carlo (MCMC) 
method to generate a probability distribution in our com- 
plete parameter space of stellar mass function parame- 
ters (Mi i o,Mi ia ,M* )0> o,M* i o, a ,/3o,/3 a ,6oA,7o,7aX systematic 
modeling errors (K,/i,cr z ), and the scatter in stellar mass at 
fixed halo mass (£). A brief summary of each of these param- 
eters appears in Table [TJ along with a reference to the section 
in which it was first described. Using this full model, we may 
calculate the stellar mass functions expected to be seen by 
observers (Expect) for a large number of points i n parameter 
space, and compare them to observed GSMFs dLi & White! 



l2009HPerez-Gonzalez et al.ll2008l) . Note that, as the observa- 
tional data always covers a range of redshifts, we must mimic 
this in our calculation of < 



/'expect- 



^expect " 



f* <t>&(z)dVc(z) 



(23) 



where dVc(z) is the comoving volume element per unit solid 
angle as a function of redshift. Then, we can write the likeli- 
hood as L = exp (-% 2 /2), where 



X 



log lo [0expect(A/*)/^ 
<T bs(M,) 



£/log 10 (M„), (24) 



and where (T b s (M*) is the reported statistical error in meas as 
a function of stellar mass. 

Note that, as defined above, the equation for x 2 contains 
the assumption that there is only one independent observa- 
tion point for the GSMF per decade in stellar mass (from the 
weight of d\og m (M)). We may tune this assumption introduc- 
ing another parameter n — the number of non-correlated obser- 
vations per decade in stellar mass — which would change the 
likelihood function to C = e\p(-n\ 2 /2) . Here, we a ssume 
that each of the data poin ts reported by iLi & White] d2009l) 
and Pe rez-Gonzalez et ail (120081) are independent — such that 
n = 10 for the former paper and n = 5 for the latter paper. 

The MCMC chains each contain 2 22 « 4 x 10 6 points. 
We verify converg ence according to the algorithm in 
Dunkl ev et al.l d2005l) : in all cases, the ratio of the sample 
mean variance to the distribution variance (the "convergence 
ratio") is below 0.005. 

3.6. Methodology Summary 

Our procedure to calculate the stellar mass - halo mass re- 
lation, taking into account all mentioned uncertainties, may 
be summarized in seven steps: 

1 . We select a trial point in the parameter space of SM- 
HM relations as well as a trial point in our parameter 
space of systematics (/1, k, a z , £). A complete list of 
parameters and descriptions is given in Table [T] 

2. The trial SM-HM relation gives a one-to-one mapping 
between halo masses and stellar masses, giving a di- 
rect conversion from the halo mass function to a trial 
galaxy stellar mass function (corresponding to direct in 
CHIT) . 

3. This trial GSMF is convolved with the probability dis- 
tributions for scatter in stel lar mass at fixed halo mass 
(controlled by £, see 33.3.11 ) and for scatter in observer- 
determined stellar mass at fixed t rue stellar mass (par- 
tially controlled by a z , see 33.1.2b . 

4. The resulting GSMF is shifted by a uniform offset in 
stellar masses (controlled by p.) to account for uni- 
form systematic differences between our adopted stellar 
masses and the true underlying masses. Also, its shape 
is stretched or compressed to account for stellar mass- 
dependent offsets between our masses and th e true un- 
derlying masses (controlled by k, see 33.1.11 ). 

5. We repeat steps |2]|4] for all redshifts in the range cov- 
ered by the observed data set. We may then calculate 
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Table 1 

Summary of Model Parameters 



Symbol 


Description 


Prior" Section 


M,,(M„) 


The halo mass for which the average stellar mass is M„ 


N/A 


3.4.3 




Mi 


Characteristic Halo Mass 


Flat (Log) 


3.4.3 




M*, 


Characteristic Stellar Mass 


Flat (Log) 


3.4.3 




P 


Faint-end power law (M/, ~ ) 


Flat (Linear) 


3.4.3 




8 


Massive-end sub-exponential (log| (M/,) ~ M^) 


Flat (Linear) 


3.4.3 




7 


Transition width between faint- and massive-end relations 


Flat (Linear) 


3.4.3 




too 


Value of the variable ( v) at the present epoch, where (.v) is one of (Mi ,M„ o 


/3,5,7) (see above) 


3.4.3 




(x) a 


Evolution of the variable (x) with scale factor 


(same as for (jc)o) 


3.4.3 




M 


Systematic offset in M* calculations 


G(0,0.25) (Log) 


3.1.1 




K 


Systematic mass-dependent offset in M* calculations 


G(0,0.10) (Linear) 


3.1.1 






Redshift scaling of statistical errors in M* calculations 


G(0.05. 0.015) (Log) 


3.1.2 





| Scatter in M* at fixed M h G(0. 16, 0.04) (Log) [3TTT 



a See Equations [Tl Bl |2~T|23 1 G(x,s) denotes a Gaussian prior centered at x with standard deviation s, in either linear or logarithmic 
units. 'Flat' denotes a uniform prior in either linear or logarithmic units. 



the expected GSMF in each redshift bin for which ob- 
servers have reported data. The likelihood of the ex- 
pected GSMFs given the measured GSMFs is then used 
to determine the next step in the MCMC chain. 

6. To account for sample variance in the observed stel- 
lar mass functions above z ~ 0.2, we recalculate each 
SM-HM relation in the chain for an alternate halo mass 
function taken from a randomly sampled mock survey 
(see 33.2.4b and re-fit our functional form to the red- 
shift evolution of the relation. Similarly, for the results 
which include cosmology uncertainty, we recalculate 
each SM-HM relation for an alternate halo mass func- 
tion randomly selected from the MCMC chain used to 
determine the WMAP5 cosmology uncertainties. 

7. We repeat steps QJ6] to build a joint probability distri- 
bution for the SM-HM relation and the systematics pa- 
rameter space. The steps are repeated until the joint 
probability distribution has converged to the underlying 
posterior distribution. 

4. RESULTS FOR < z < 1 

We now present the results of this approach to d etermine 
the SM-HM relation and related quantities. In 34.11 we com- 
pare GSMFs generated from our best fits to observed data and 
comment on the effects of systematic observational biases. 
We present our best-fit ting results for the SM-HM relation 
with full error bars in 34.21 We evaluate the relative impor- 
tance of each of the contributing types of err or in 34.31 and 
summarize the most relevant contributions in 34.3.71 Finally, 
our derive d SM -HM relation is compared to other published 
results in 3441 

4. 1 . Galaxy Stellar Mass Functions 

To demonstrate that our functional form for M^(M*) is ca- 
pable of reproducing observed galaxy stellar mass functions, 
we show a comparison between our best-fit models and the 
observed data in Figs. [3] and [4] at several redshifts. For our 
best-fit models, both (f> lml , (the true or "intrinsic" stellar mass 
function) and </> m eas ( tne GSMF that observers would measure) 
are shown. Recall that </> meas incorporates the effects of the 
systematic observational biases; namely, the overall shift in 
stellar mass calculations, /i, the linearly mass-dependent shift, 
k, and the statistical errors in stellar mass calculations for in- 
dividual galaxies, er(z). The fact that the best-values of the 



systematic parameters (/i, k, £, er,) are very close to the cen- 
ters of their prior distributions provides confirmation th at the 
functional form for the SM-HM relation outlined in 33.4.31 
does not bias our best-fit results. 

As our best-fit values for fi and n are close to zero (see 
Table O, the difference between 4> tme and meas is almost ex- 
clusively due to the scatter a(z) in calculated stellar masses. 
The difference between (f> tme and </W as only becomes evident 
for galaxies above 10 11 M Q , where the falling slope of the 
GSMF becomes severe enough for the scatter a(z) to signifi- 
cantly raise number counts in the observed GSMF. At z ~ 0, 
the systematic effect of er(z) puts the intrinsic GSMF well be- 
low the small statistical error bars. 

At higher redshifts, although the effect of cr(z) is larger, 
current surveys at z > 0.2 do not yet cover sufficient volume 
to constrain the shape of the GSMF well at the massive end. 
Nonetheless, for future wide-field surveys at z > 0.2, correc- 
tion to the GSMF for scatter in calculated stellar masses will 
be an important consideration. 

4.2. The Best-Fit Stellar Mass - Halo Mass Relations 

We plot the average stellar mass as a function of halo mass 
for z = — 1 in Figure [5] to show the evolution of the stellar 
mass - halo mass relation. Note that as the stella r mass at a 
given halo mass has a log-normal scatter (see £12.31) . we use ge- 
ometric averages for stellar masses rather than linear ones. To 
highlight the effects of halo mass on star formation efficiency, 
we also present the SM-HM relation in terms of the average 
stellar mass fraction (stellar mass / halo mass) for z = - 1 
as a function of halo mass in the same figure. We focus on 
this quantity for the remainder of the paper. The best-fit pa- 
rameters for the function M/,(M*) are given in Table |2j and 
the numerical values for the stellar mass fractions are listed in 
AppendixiDl 

The stellar mass fractions for central galaxies consistently 
show a maximum for halo masses near 10 12 M . While the 
location of this maximum evolves with time, it clearly il- 
lustrates that star-formation efficiency must fall off for both 
higher and lower-mass halos. The slopes of the SM-HM re- 
lation above and below this characteristic halo mass are in- 
dicative of at least two processes limiting star-formation effi- 
ciency, although mergers complicate direct analysis for high- 
mass halos. At the low-mass end, the SM-HM relation scales 
as M* ~ Ml 3 at z = and as ~ M 2,9 at z = 1 . However, 
given the lack of information about low stellar-mass galaxies 
at z > 0.5, the statistical significance of this evolution is weak; 
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Figure 3. Comparison of the best fit (fitrue (the true or "intrinsic" GSMF) 
in our model to the resulting </> mca s (what an observer would report for the 
GSMF, which includes the effects of the systematic biases fj,. re, and a) at 
Z = 0. Since the best-fit values of fj, and re are very close to zero, the difference 
between mMS and <f> tme almost exclusively comes from the uncertainty in 
measuring stellar masses (<r). 



Table 2 

Best fits for the redshift evolution of M/,(M» ) 



Parameter Free (/^, re) 
0<z.< 1 



< 



i = 

:< 1 



Free (^t, re) 
0.8<z<4 



M„,o,o 


10 72+0' 22 


10 72+0- 02 


1 1 09 +0 - 54 
X1, -0.31 




55 +0 - 18 


59 +0 - 15 


56 m89 
U - JO -0.44 




N/A 


N/A 


6 99+ 2 69 






i 2 ic+0.02 


12.27^f 


Mi, a 


0.28$? 


10+°- 14 




A) 




0.43^1 


n <;<;+0.26 
°- 65 -0.20 






0.18*1 


3 1+0-38 




57+0-15 




56+ 133 




17+0-42 


o.i¥jj* 


-0 12+0- 76 
u - lz -0.50 


To 


1 56^- 12 


, 54+0.03 
1 ■ -0.40 


1 12+ 7 - 47 


7« 


2 SI""- 15 


9 S9+0.03 
i-J -1.89 


-o <n+ 7 - 87 


M 


00 m24 
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no evolution in the low-mass slope of the relation is consistent 
within our one-sigma errors. Several studies (most recently, 
iBaldry etalll2008t iDrory et all 120091) have reported that the 
GSMF has an upturn in slope for very low stellar masses, 
particularly below 1O 8,5 M ; this would imply that our best 
fits may overestimate the scaling relation for galaxies below 
10 8 ' 5 Mq. At the high mass end, our best fitting function re- 
sults in a progressively shallower relation for the growth of 
stellar mass with halo mass, so that no single power law can 
describe the scaling. However, for halos close to 10 I4 M Q , 
the best-fit relation scales locally as M* ~ 28 at z = and 
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Figure 4. Comparison of the best fit t me (the true or "intrinsic" GSMF) to 
the resulting </> me as (as in Figure[5J, for z. = 0.5 and z = 1.15. Statistical errors 
in individual stellar masses have a larger effect at higher redshift, resulting in 
a steeper intrinsic bright end than measured. 

~ M°- 34 at z = 1, in accord with previous studies (see 34.41 ). 
The results for high mass halos are also consistent with no 
evolution in the slope of the SM-HM relation. 

Figure [6] shows the stellar mass fraction for < z < 1 ex- 
cluding the effects of systematic shifts in stellar mass calcu- 
lations (i.e., assuming \i = k = 0). Under the assumption that 
systematic errors in stellar mass calculations result in similar 
biases in stellar masses at z = as they do at higher redshifts, 
this allows us to consider the evolution in normalization of the 
SM-HM relation. Low-mass halos (below 10 12 M ) display 
clearly higher stellar mass fractions at late redshifts than they 
do at early redshifts. By contrast, the evolution in stellar mass 
fractions for high mass halos (above 10 13 5 M ) is not statis- 
tically significant, and it is constrained to be substantially less 
than for low-mass halos. In the time since z = 1, this means 
that the star formation rates for high-mass halos typically fall 
relative to their dark matter accret ion rates, whereas the op - 
posite is true for low-mass halos (IConrov & W echsler 2009). 
The best-fitting parameters for the SM-HM relation assum- 
ing [i = k = appear in Table [2] and the data points in Figure 
[6] appear in Appendix iDl 

4.3. Impact of Uncertainties 
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Figure 6. Evolution of the derived stellar mass fractions (M„ /M/,) in the 
absence of systematic errors. This result is analogous to Figure [5] bottom 
panel, calculated under the assumption that the true values of the systematics 
fjj and re in the stellar mass function are zero at all redshifts. 
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Figure 5. Top panel: Stellar mass - halo mass relation as a function of red- 
shift for our preferred model. Bottom panel: Evolution of the derived stellar 
mass fractions (A/./Mj,). In each case, the lines show the mean values for 
central galaxies. These relations also characterize the satellite galaxy pop- 
ulation if the horizontal axis is interpreted as the halo mass at the time of 
accretion. Errors bars include both systematic and statistical uncertainties, 
calculated for a fixed cosmological model (with WMAP5 parameters). 

4.3. 1 . Systematic Shifts in Stellar Mass Calculations 

By far the largest contributor to the error budget of the SM- 
HM relation is the systematic error parameter /i. As the ef- 
fect of /i is to multiply all stellar masses by a constant factor, 
and as the width of the error bars in Figure [^corresponds al- 
most exactly to the prior on /i, we may conclude that reducing 
the error on the systematic shifts in stellar mass calculations 
would represent the single largest improvement in our under- 
standing of the shape of the SM-HM relation. Figure[6]shows 
the substantially smaller error bars that result if systematic er- 
rors (pi and ft) in the stellar mass calculations are neglected. 

4.3.2. Scatter in Stellar Mass at Fixed Halo Mass 

The effect of ignoring scatter in stellar mass at fixed halo 
mass (i.e., setting £ = 0) is shown at two redshifts in Figure[7] 
We find that the change is insignificant below halo masses of 
10 12 Mq, and is within statistical error bars below 10 13 Mq 
for z = 1 ■ This is a result of the fact that the slope of the stellar 
mass function below 10 Mq in stellar mass (correspond- 
ing to 10 12 Mq in halo mass) is not steep enough for scat- 



ter to have significant impact (see also Tasits iomi et al.f 2004). 
Because £ > results in high stellar-mass galaxies being as- 
signed to lower-mass halos than they would be otherwise (due 
to the higher number density of lower-mass halos), the effect 
is that higher-mass halos contain fewer stars on average than 
they would for £ = 0. The effect of setting £ = exceeds sys- 
tematic error bars only for the very highest mass halos, above 
1O 14 - 5 M . 

We note that our posterior distribution constrains £ to be 
less than 0.22 dex at the 98% confidence level. Higher val- 
ues for £ wo uld result in GSM Fs inconsistent with the steep 
fa Hoff of the iLi & White! (120091) GSMF (see also discussion 
in lGuo et alJl2009l) . 

4.3.3. Statistical Errors in Stellar Mass Calculations 

The significance of including or excluding random statisti- 
cal errors in stellar mass calculations, a(z), is also shown Fig- 
ure|7l The effect of this type of scatter on the SM-HM relation 
is mathematically identical to the effect of scatter in stellar 
mass at fixed halo mass. As <j{z = 0) (~ 0.07 dex) is much 
smaller than the expected value of £ (~ 0.16 dex), the con- 
volution of the two effects is only marginally different from 
including £ alone at z = 0; this results in only a minor effect on 
the SM-HM relation. The effect becomes more pronounced 
at z = 1 for the reason that a{z = 1) (~ 0. 12 dex) becomes more 
comparable to £ — and so including the effects of statistical er- 
rors in stellar mass becomes as important as modeling scatter 
in stellar mass at fixed halo mass. 

4.3.4. Cosmology Uncertainties 

In Figure[8] we show a comparison of best fits for the stellar 
mass fraction using abundance matching with three different 
halo mass func tions: analytic prescriptions for WMAP5 and 
WMAP1 (see 33.2.21 ) as well as the mass function taken di- 
rectly from the L80G simulation (see 33.2.11 ). The difference 
between the L80G simulation and the analytic WMAP1 mass 
function is slight, as the L80G simulation uses WMAP1 initial 
conditions (h = 0.7, Q m = 0.3, £1 A = 0.7, cr 8 = 0.9, n s = 1); the 
difference is consistent with sample variance for the relatively 
small (80 h Mpc) size of the simulation. The difference be- 
tween SM-HM relations using WMAP1 and WMAP5 cos- 
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Figure 7. Comparison between SM-HM relations derived in the preferred model (including the effects of the statistical errors cr(z) and taking the scatter in 
stellar mass at a given halo mass to be £ = 0. 16dex) to those excluding the effects of cr(z) or taking £ = 0, at z = (left panel) and z = 1 (right panel). Light 
shaded regions denote 1-cr errors including both systematic and statistical errors; dark shaded regions denote the l-cr errors if the systematic offsets in stellar 
mass calculations (p and k) are fixed to 0. 



mologies is within the systematic errors at all masses. When 
systematic errors are neglected, the two cosmologies yield 
SM-HM relations that are noticeably different only at low 
halo masses (M < 1O 12 M ). 

Figure [9] show the results of including uncertainti es in the 
WMAP5 cosmological parameters. As described in 33.11 this 
is done using halo mass functions calculated with parameters 
resampled from the cosmological parameter chains provided 
by the WMAP team. Only at z ~ are the changes in error 
bars significant enough to justify mention. Here, the uncer- 
tainty in cosmology begins to exceed other sources of statis- 
tical error for halos below 10 12 M© due to the small errors 
on the GSMF at the stellar masses associated with such ha- 
los dLi & Whitel2009l) . However, the cosmology uncertainties 
are still well within the systematic error bars. 

4.3.5. Sample Variance 

Because of the large volume of the SDSS, sample variance 
contributes insignificantly to the error budget for the SM-HM 
relation below z = 0.2. A bove that redshift, the compa ratively 
limited survey volume of Perez-Gonzal ez et"afl (|2008) results 
in sample variance becoming an important contributor to the 
statistical error for halos below 10 I2 M Q (Poisson noise dom- 
inates for larger halos). If the effects of sample variance were 
ignored, the statistical error spreads for our derived SM-HM 
relations at z = 1 would shrink from 0.12 dex to 0.09 dex for 
10 n M Q halos, and from 0.05 dex to 0.04 dex for 10 12 25 M Q 
halos. As with other types of errors, these considerations are 
well below the limits of the systematic error bars. 

We caution that our error bars including sample variance 
at z > have a very specific meaning. Namely, they include 
the standard deviation in out fitting form wh i ch m ight be ex- 
pected if the survey in iPerez-G onzalez et al. (2008) had been 
conducted on alternate patches of the sky. Sample variance at 
redshifts z > impacts only the linear evolution of the SM- 
HM relations we derive, as the large volume probed by the 
SDSS constrains the SM-HM relation very well at z ~ 0. Be- 
cause our fit is matched to the ensemble of reported data be- 
tween < z < 1, it is less vulnerable to the effects of sample 
variance in individual redshift bins. Instead, it is affected most 
by overall shifts in the number densities reported for the en- 



tire high-redshift survey. While this means that our fit gives 
a more robust SM-HM relation at all redshifts, some caution 
must be used when comparing our relation to results derived 
from the GSMF in a single redshift bin (e.g., 0.2 < z < 0.4). 
These will have much larger uncertainties due to sample vari- 
ance than SM-HM relations derived (like ours) from GSMFs 
along a light cone probing a large redshift range. 

To demonstrate the credibility of our approach for calculat- 
ing the appropriate error bars including sample variance, we 
repeat ed our analysis of the SM-HM relation using GSMFs 
from iDrory et aD (|2009) (whic h appeared as we wer e com- 
pleting this work) instead of IPerez-Gonzalez et al.1 (120081) 
for 2 < z < 1 and retaining the GSMF from iLi & White! 
(12009b for z < .2. Although the COSMOS survey in 
IDrory et ail d2009l) covers a m uch larger area (^9x the area 
in IPerez-Gonzalez et al.l 120081) . the fact that it is a single 
field means that the expected sample va riance is only s lightly 
smalle r than for the combined fields in Perez-Gonza lez et al.1 
( 2008). As might be expecte d, the SM-HM relation at z = 0. 1 
using the IDrory et all (120091) GSMF is identical to the result 
in Figure [6] because of the strong constraining power of the 
SDSS data s ample. At z = 1, th e SM-HM relation generated 
by using the IDrory et ail (120091) GSMF is within our quoted 
statistical and sample variance errors, as shown in Figure fT2l 
This may not be surprising unless one considers that clus- 
tering results suggest an over density at the 2-3 a level in the 
COSMOS redshift bin z = 1 dMeneux et al.ll2009f) . However, 
because our method fits the IDrory et al. (2009) GSMFs across 
the entire redshift range, the excess at z = 1 is partially offset 
by an underdensity at z = 0.5. This demonstrates the robust- 
ness of our fitting method to the effects of sample variance 
except on the scale of the entire survey. 

4.3.6. Satellite Treatment 

Finally, we consider the changes in both the stellar mass 
fraction and total stellar mass fraction (total stellar mass in 
central galaxy and all satellites / total hal o mas s) induced by 
different satellite evolution models (see {33.3.2l for details on 
the two models). As shown in Figure[l0] fixing satellite stellar 
mass at the redshift of accretion (lines labeled as "SMF acc ") 
has virtually no effect on either fraction as compared to allow- 
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Figure 8. Comparison between stellar mass fractions in different cosmolo- 
gies. Light shaded regions denote systematic error spreads while dark shaded 
regions denote error spreads assuming fj, = K = 0, both about the WMAP5 
model. The dot-dashed blue line shows the fiducial relation for a WMAP1 
cosmological model (using our analytic model) The dashed red line shows 
the relation for a simulation of the WMAP1 cosmology. Differences between 
this and the analytic model are within the expected sample variance errors. 
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Figure 9. Effect of cosmological uncertainties on the stellar mass fraction 
at z = 0. 1 . The error bars show the spread in stellar mass fractions includ- 
ing both statistical errors and cosmology uncertainties (from WMAP5 con- 
straints, Komatsu et al. 2009). For comparison, the light shaded region in- 
cludes statistical and systematic errors, while the dark shaded region includes 
only statistical errors. 



ing satellite stellar mass to evolve the same way as centrals 
with the same mass (labeled as SMF now ). Because compar- 
ison of different satellite evolution models requires tracking 
satellites through merger trees, Figure [10] shows results only 
for satellites in the L80G simulation. 

The treatment of satellites may have a somewhat larger im- 
pact on the total stellar mass fraction, including the stellar 
mass of both central and satellite galaxies within a halo. This 
is shown for both models in Figure [10] Because of the steep 
fall-off in stellar mass for low mass galaxies, the total stellar 
mass fraction has only minimal contribution from satellites 
for low mass halos, and deviates significantly from the stellar 
mass fraction for centrals only at halo masses 
At cluster-scale masses (Mi, ~ 1O 14 M ), accreted satellites 
have on average a higher ratio of stars to dark matter than the 
central galaxy, and the total stellar mass fraction can be many 
times the central stellar mass fraction. However, the impact 
of the two models for satellite treatment on this ratio is small. 
Profiles of satellite galaxies in clusters should be able to better 
distinguish between such models. 

4.3.7. Summary of Most Important Uncertainties 

Systematic stellar mass offsets resulting from modeling 
choices result in the single largest source of uncertainties 
(^0.25 dex at all redshifts). The contribution from all other 
sources of error is much smaller, ranging from 0.02-0.12 dex 
at z = and from 0.07-0.16 dex at z = 1. On the other hand, 
this statement is only true when all contributing sources of 
scatter in stellar masses are considered. Models that do not 
account for scatter in stellar mass at fixed halo mass will over- 
predict stellar masses in 10 14 25 M halos by 0.13-0.19 dex, 
depending on the redshift. Models that do not account for 
scatter in calculated stellar mass at fixed true stellar mass will 
overpredict stellar masses in 10 14 25 M Q halos by 0.12 dex at 
z = 1 . Hence, it is important to take both these effects into 
account when considering the SM-HM connection either at 
high masses or at high redshifts. 
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Figure 10. Comparison between stellar mass fractions and total stellar mass 
fractions (labeled as "Total M»/Mh") derived by assuming different matching 
epochs for satellite galaxies. The L80G simulation was used here in order 
to follow the accretion histories of the subhalos. The relations terminate at 
high masses where the halo statistics become unreliable due to finite-volume 
effects. 

4.4. Comparison with other work 

A comparison of our results with several results in the liter- 
ature at z ~ 0. 1 is shown in FigureQl] Such comparison is not 
always straightforward, as other papers have often made dif- 
ferent assumptions for the cosmological model, the definition 
of halo mass, or the measurement of stellar mass. In addi- 
tion, some papers report the average stellar mass at a given 
halo mass (as we do), and others report the average halo mass 
at a given stellar mass. Given the scatter in stellar mass at 
fixed halo mass, the averaging method can affect the resulting 
stellar mass fractions, particularly for group- and cluster-scale 
halo masses. To facilitate comparison with both approaches, 
we plot our main results (labeled as "(M*/M/,|M/,)") along 
with results for which the stellar mass fractions have been av- 
eraged at a given stellar mass (labeled as "(M*/M/,|M*)"). 
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Figure 11. Comparis on of our best-fi t model at z = 0.1 to previously published result s. Results shown include other res ults from ab undance matching 
i Moster et al. 2009 and Guo et al. 2009); abundance matching plus clustering constraints I Wang & Jing 2009); HOD modeling I Zheng et al. 2007); direct mea- 
surements from weak lensing (Mandelbaum et al. 2006), statellite dynamics (Klypin et al. 2009) and strong lensing I Gavazzi et al. 2007); and clu sters selected 
from SDSS spectroscopic data (Yang et al. 2009a), SDSS photometric data (the maxBCG sample Hansen et al. 2009), and X-ray selected clusters (Lin & Mohr 
2004). Dark grey shading indicates statistical and sample variance errors; light grey shading includes systematic err ors. The red line sh ows our results averaged 
over stellar mass instead of halo mass; scatter affects these relations differently at high masses. The results of Mandelbaum et al. (2006) and Kly pin et al.1 <2009l) 
are determined by stacking galaxies in bins of stellar mass, and so are more appropriately compared to this red line. 



In the comparisons below, we have not adjusted the assump- 
tions used to derive stellar masses, because such adjustments 
can be complex and difficult to apply using simple conver- 
sions. Additionally, we have only corrected for differences in 
the underlying cosmology for those papers using a variant of 
abundance matching method (IMoster et al.l 120091; iGuo et ail 
120091: IWang & JingfCQOlSt IConrov et al.ll2009t) using the pro- 
cess described in Appendix [A] as alternate methods require 
corrections which are much more complicated. We have, 
ho wever, adjusted t he IMF of all quoted stellar masses to that 
of ChabrierT (12003b . and we have co nverted all quoted halo 
masses to virial masses as defined in 33.2.21 

The closest comparison with our work, using a very similar 
method, is the result from IMoster e t al. (2009). This result is 
in excellent agreement with ours at the high mass end, and is 
within our systematic errors for all masses considered. How- 
ever, their less flexible choice of functional form, and their 
use of a different stellar mass function (estimate d from spec- 
troscopy using the results of Pant er et ail 120071) results in a 
different value for the halo mass M pea i with peak stellar mass 
fraction and a shallower scaling of stellar mass with halo mass 



at the low mass end. Their error estimates only account for 
statistical variations in galaxy number counts, and they do not 
includ e sample varian ce or variations in modeli ng assump- 
Guo et ail (120091) use a sim ilar approac h to Most er et al.l 



tions. 



(2009), using stellar masses from lLi & White! (12009b . but they 
do not account for scatter in stellar mass at fixed halo mass. 
Consequently, their results match ours for 10 12 M Q and less 
massive halos, but overpredict the stellar mass for larger ha- 

los. 

I Wang & Jingl (12009b use a parameterization for the SM- 
HM relation for both satellites and centrals, and they attempt 
to simultaneously fit both the stellar mass function and clus- 
tering constraints, including the effects of scatter in stellar 
mass at fixed halo mass. At z ~ 0. 1 , their data source matches 
ours dLi & Whit e 2009), but their approach finds a best-fit 
scatter in stellar mass at fixed halo mass of £ = 0.2 dex, es- 
senti ally the highest value allowed by the stellar mass func- 
tion dGuo et al.ll2009l) . As this is higher than our best-fit value 
for £, their SM-HM relation falls below ours for high-mass 
galaxies. Possibly because of the limited flexibility of their fit- 
ting form (they use only a four-parameter double power-law), 
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their SM-HM relation is in excess of ours for halo masses 
n ear 10 12 M W 

IZheng et ail d2007l) used the galaxy clustering for 
luminosity-selected samples in the SDSS to constrain the halo 
occupation distribution. This gives a direct constraint on the 
r-band luminosity of central galaxies as a function of halo 
mass. Stellar masses for this sample were determined us- 
ing Jhe_g^r color and the r-band luminosity as given by 
the lBell etail d2003) relation, and a WMAP1 cosmology was 
assumed. This method allows for scatter in the luminosity 
at fixed halo mass to be constrained as a para meter in the 
model ; results for this scatter are consistent with Mo re et all 
(2009) , although the y are less well constra ined. According 
to lLi& Wh ite (2009), stellar masses for the lBell et all (120031) 
relation are systematicall y larger than those calculated using 
iBlanton & Roweisl (12007) by 0.1-0.3 dex. However, as £l m in 
WMAP1 is larger than in WMAP5, halo masses in WMAP1 
will be higher at a given number density than in WMAP5, 
somewhat compensating for the higher stellar masses. 

We next compare to constraints from direct measure- 
ments of halo mass e s from dynamics or gravitational lensing. 
Mandelbaum et al. (2006) have used weak lensing to mea- 
sure the galaxy-mass correlation function for SDSS galax- 
ies and derive a mean ha lo mass as a function of stellar 
mass. iMandelbaum et all d2006) assume a WMAP1 cos- 
mology and uses spectro scopic stellar masses, calculated per 
Kauffmann et al] ( 12003b . Klypin et al (in preparation) have 
derived the mean halo mass as a function of stell ar mass us- 
ing satellite dynamics of SPSS galaxies (see al so Prada et al. 
120031: Ivan den Bosch et aill2004t IConrov et al.ll2007l) Their 
results are generally within our systematic errors but lower 
than others at the lowest masses and with a somewhat dif- 
ferent shape. This may be due to selection effects, as their 
work uses only isolated galax ies, which may hav e somewhat 
lower average stellar masses. Gava zziet alj (120071) use a set of 
strong lenses from the SLACS survey along with a model for 
simultaneously fitting the stellar and dark matter components 
of the stacked lens profiles. This result, at one mass scale, is a 
bit higher than our error range but within 1 .5 a. The selection 
effects relevant to strong lenses are beyond the scope of this 
paper; however, within the effective radius, the stellar mass 
can easily contribute more to the lensing effect than the dark 
matter. Thus, at any given halo mass, the halos with less mas- 
sive galaxies are much less likely to be strong lenses, resulting 
in a bias towards higher stellar mass fractions in strong lenses 
as compared to halos selected at random. 

At the high mass end, one can directly identify clusters and 
groups corresponding to dark matter hal os, and measure the 
stellar masses of their central galaxies. lYang et aT] d2009al) 
use a group catalog matched to halos to determine halo masses 
(via an iteratively-computed group luminosity-mas s relation). 
Stellar masses in this work are determined using the Be ll et al.1 
(2003) relation between g-r color and M/L; a WMAP3 cos- 
mology was assumed. Their results agree very well with ours 
for low-mass halos, but they begin to differ at higher masses. 
This may be partially due to scatter between their calculated 
halo masses (based on total stellar mass in the groups) and 
the true halo masses, resulting in additional scatter in their 
stellar masses at fixed halo mass. It could also be due to dif- 
ferences in stellar modeling; their results remain at all times 
within our systematic errors. We al so compare to direc t mea- 
surements o f mass ive clusters by lHansen et al.l (12009ft and 
iLin & Mohrl ( 120041) . In order to convert luminosities to stel- 
lar masses, we assume M/L,o.25 = 3.3M Q /L Q ,,o.25 and M/L K 



= 0.83M W /L ( 7),jf ba sed on the population synthesis code of 
IConrov et al.l (12009). These measurements are both somewhat 
higher than our results for massive clusters, the one-sigma er- 
ror estimates overlap. The discrepancies may be due to is- 
sues with cluster selection and with modeling scatter in the 
mass-observable relation; in each case the cluster mass is an 
average mass for the given observable (X-ray luminosity or 
cluster richness), and can result in a bias if central galaxies 
are correlated with this observable. More detailed modeling 
of the scatter and correlations will be required to determine 
whether this is can account for the offsets. 

A comparison of our results to others at z ~ 1 is shown in 
Figure [12] As may be expected, it is much harder to directly 
measure the SM-HM relation at higher redshifts, resulting in 
relatively fewer published results with which we may com- 
pare. We first note that we have compared the impact of 
two independent measure ments of the GSMF from different 
surveys. As discussed in 14.3.51 because we simultaneously 
fit our model with linear evolution to the GSMF at redshifts 
< z < 1, our results are less sensitive to s ample variance. 
In contrast to the conclusion of iDrorv et al.l (120091) which fit 
their results to spec ific redshif t bins, Fig ure[T2"lshows that the 
IDrorv et all d2009b and iPerez-Gonzalez et all ( 120081) results 
are in agreement within statistical error s at z ~ 1 when fit- 
ting the full redshift range. The results in lMoster et al.l (120091) 
are als o very similar to ours at z ~ 1 . However. iMoster et al.l 
(2009) do not give fits for the SM-HM relation which include 
the effects of scatter in stellar mass at fixed halo mass except 
at z ~ and do not in general include scatter in measured stel- 
lar mass with respect to the true stellar mass. Hence, we in- 
clude for comparison an SM-HM relation derived using our 
analysis but excluding both of these effects. The remaining 
deviance most likely stems from sample variance due to the 
much smaller survey volum e on w hich their fit is based. 

At z ~ 1, I Wang & Jrng (2009) make use of clustering 
data and stellar mass functions from the VVDS survey 
dMeneuxet all 120081: iPozzetti et ail 120071) at z ~ 0.8. At the 
same time, the high-mass and low-mass slopes of their power- 
law relation are not re-fit for the higher redshift, leading to 
si milar deviations fro m our results as at z ~ 0.1. The results 
in iZheng et"ail (120071) he slightly below our results at z ~ 1 . 
This is partially due to their use of a WMAP1 cosmology. 
However, it is difficult to say exactly how much, as their stel- 
lar masses at z ~ 1 are a hybrid of ^-derived masses from 
iBundv et al.l (120061) and color-bas ed masses derived in a man- 
ner analogous to Bel l et al.l ((2003). Nonetheless, they remain 
well within our systematic error bars. 

We also includ e a comparison to the z = 1 SM-HM rela- 
tion presented in IConrov & Wechslerl d2009l) . who also use 
an abundance matching tech nique to assign galax i es to ha- 
los. Unique to the work of IConrov & Wechslerl ((2)09) is 
their attempt to jointly fit both the redshift-dependent stel- 
lar mass functions and the redshift-dependent star formation 
rate - stellar mass relations. In their model, halo growth is 
tracked through time using results derived from halo merger 
trees, allowing galaxies to be identified across epochs. The 
evolution of halos in conjunction with standard abundance 
matching provides model pr edictions for star formation r ates. 
The SM-HM relation from IConrov & Wechslerl (12009ft lies 
slightly above our best-fit relation, and it is within statistical 
error bars except at the very highest halo masses. This is due 
to the different assumptions made about the GSMF in the two 
works and also to the absence of corrections for the scatter i n 
stellar mass at fixed halo mass in IConrov & Wechslerl (2009). 
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Finally, the strong le nsing survey of jGavazzi et alj d2007l) 
has been extended by (Lagat tuta et alj 120091) out to z ~ 0.9 
using lenses observed in the CASTLES program, as well as 
in COSMOS and in the EGS. While the sam e caveats about 
selecti on effects apply as for lower redshifts, Lagattu ta et alj 
(2009) find that the evolution in the stellar mass fraction for 
M h ~ 10 13 5 M Q halos is within 0-0.3 dex greater at z ~ 0.9 as 
compared to z ~ 0, consistent with our limits of 0-0.15 dex 
for the allowed evolution over that redshift interval. 

5. RESULTS BEYOND z = 1 
5.1. Methodology and Data Limitations 

As discussed in 92.1.21 published results for the galaxy 
stellar mass function beyond z = 1 suffer from the important 
caveat that integrated SFRs are inconsistent with galaxy stel- 
lar mass functions when both sets of observations are taken 
at face value with a constant IMF. Nonetheless, one may use 
similar methodology as in ^3] to derive stellar mass - halo 
mass relations at higher redshifts under the assumption that 
the observed stellar mass functions ar e corre ct. Here we as- 
sume the GSMFs of Mar chesini et alj (|2009), which cover a 
redshift range of 1 .3 < z < 4. 

As there is no guarantee that the evolution of the SM - HM 
relation at high redshifts will have the same form as its evo- 
lution at low redshifts, we re-examine the assumptions affect- 
ing our evolution parameterization in Equation |23| As with 
all cur rent high-redshift data, the results in Marchesi ni et alj 
(2009) are limited to luminous (massive) galaxies, so little 
information about the value of /3 (the faint-end slope of the 
galaxy-halo mass relation) is available. Hence, we continue 
to assume a linear functional form for its evolution; as the 
value of P evolves linearly with scale factor (a) in our fit, this 
means that it is largely constrained to be consistent with the 
evolution at lower redshifts (1 < z < 2). Naturally, if the evo- 
lution of j3 at high redshifts is significantly different than for 
1 < z < 2, then our error bars for /3 may underestimate the full 
uncertainties in the parameter. 

Additionally, the systematics affecting high-mass galaxies 
at high redshifts are much more severe than for z < 1 . Not 
only are the errors in stellar mass calculations significant (due 
to larger photometry errors, limited templates, etc.), but any 
miscalibration in correcting photometric redshift errors will 
result in low-redshift galaxies masquerading as very bright 
high-redshift galaxies. These combined uncertainties result 
in poor constraints on high-mass galaxies. For that reason, 
we do not attempt to assume a more complicated functional 
form for the evolution of 6 and 7, which means as before that 
their rates of evolution are largely constrained to be consistent 
with lower redshifts. 

However, we find that individual fits at each redshift do 
suggest a possible evolution for the characteristic stellar mass 
which is nonlinear in the scale factor. Hence, we expand the 
form of the evolution of M*.o to include a quadratic depen- 
dence on scale factor: 

M*,o(fl)=M*,o,o+M*, , fl (fl-l)+M >tA „2(fl-0.5) 2 , (25) 

where (a-0.5) 2 is used instead of (a - l) 2 to minimize the 
degeneracy between M».o, fl and M^ x) a i. We parameterize the 
evolution of all other parameters as in Equation|23] All other 
methodo logy remains the same as for lower redshifts, as out- 
lined in £y|l 

5.2. Results 
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Figure 12. Comparison of our best-fit model at z = 1.0 for different model 
assumptions and to previously published results. Dark grey shading indicates 
statistical and sample variance errors; light grey shading include s system- 
atic er rors. The error bars for the red line, calculated using the Dr orv et aT] 
1 2009) GSMF include statistical errors only — i.e., they do not include sample 
variance. The results of Moster et al. (2009) (green line) do not include mod- 
eling of scatter or statistical errors in stellar masses, so for comparison, we 
pr esent our results excl uding the effects of <r(z) and £ (blue line). The results 
of Conroy & Wechsler (2009) made slightly different assumptions about the 
stellar mass function evolution. 

To maintain some overlap with the z < 1 results, we evalu- 
ate the likelihood function for each SM - HM relation against 
GSMFs for 0.8 < z < 4. This dat a rang e includes two redshift 
bins from lPerez-Gonzalez et all (120081) (0 . 8-1.0 1.0-1.3) and 
three redshift bins from iMarchesin iet all (120091) (1.3-2, 2-3 
3-4). I ncluding more redshift bins from lPerez-Gonzalez et al] 
(2008) would improve the continuity of the fits to the low- 
redshift results; however, doing so would require a more 
complicated redshift parameterization than what we have as- 
sumed. The evolution of the best-fit stellar mass fraction for 
< z < 4 is shown in Figure Q~3] All data points for Figure [T3l 
are listed in AppendixiDl 

As may be expected, uncertainties at high redshifts are sub- 
stantially larger than at lower redshifts. The contribution of 
systematic errors in stellar masses to the error budget (0.25 
dex) is still important, but it is no longer the only dominant 
factor. Statistical errors due to the comparatively small num- 
ber of galaxy observations at high redshifts can contribute an 
equal uncertainty (up to 0.25 dex) to the derived SM-HM re- 
lation. The statistical errors are large not only for massive 
galaxies with low number counts, but also for halos below 
10 12 M , where magnitude limits on surveys make observa- 
tions of the corresponding galaxies difficult. 

The contribution from other sources of uncertainties (e.g., 
sample variance, cosmology uncertainties) is substantially 
smaller than the current statistical errors. The effects of sam- 
ple variance on uncertainties at high redshift is less than for 
low redshifts because the volume probed in the high redshift 
sample is five times larger than for the low redshift sample 
(«5x 10 6 Mpc 3 vs. 10 6 Mpc 3 , respectively). While cos- 
mology uncertainties are somewhat larger at high redshifts, 
their contribution to the overall levels of uncertainty are again 
much smaller than the statistical errors, as was true even by 
z = 1 for the low-redshift sample. 

The statistical uncertainties at high redshifts mean that it 
is difficult to draw strong conclusions about the evolution 
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Figure 13. Evolution of the derived stellar mass fractions for central galax- 
ies, from z = 4 to the present. The best-fit relations are shown only over the 
mass range where constraining data are available. At higher redshifts, cer- 
tainty about the shape of the curves drops precipitously owing to a lack of 
constraining data beyond the knee of the stellar mass function. Combined 
systematic and statistical error bars are shown for three redshift bins only. 

of the SM-HM relation. However, the indication is that the 
mass corresponding to the peak efficiency for star formation 
evolves slowly, and is roughly a factor of five larger at z = 4. 
At all redshifts, the integrated star formation peaks at ~ 10- 
20 per cent of the universal baryon fraction; the current data 
indicates that this value may start high at very high redshifts, 
shrink as halos grow faster than they form stars, and then start 
growing again after z = 2. However, with current uncertain- 
ties these results are tenuous. The single most effective way 
to reduce current uncertainties on both the SM-HM relation 
at individual redshifts and on its evolution is to conduct more 
high-redshift galaxy surveys, both to probe fainter galaxies to 
determin the shape of the GSMF and to get better statistics at 
the high mass end. 

6. DISCUSSION AND IMPLICATIONS 

At z ~ 0, the majority of published results are in accord 
within our full systematic error bars, regardless of the tech- 
nique used. All reported results appear to be consistent with 
the principles necessary for abundance matching over a wide 
range of halo masses (10 11 - 10 15 Mq) — that each dark matter 
halo and subhalo above the masses we have considered hosts 
a galaxy with a reasonably tight relationship between their 
masses, and that average stellar mass — halo mass relation 
increases monotonically with halo mass. 

Because of the available statistics of halo and galaxy stel- 
lar mass functions, especially at z = 0, the technique of abun- 
dance matching offers the tightest constraints on the SM-HM 
relation currently available, and it is in agreement with results 
from a broad variety of additional techniques. Under the as- 
sumption that systematic errors in stellar mass calculations do 
not change substantially with redshift, abundance matching 
offers tight constraints on the evolution of the SM-HM rela- 
tion from z = 1 to the present. These in turn will serve as im- 
portant new tests for star formation prescriptions and recipes 
in both hydrodynamical simulations and semi-analytic mod- 
els, as they will apply on the level of individual halos instead 
of on the simulated volume as a whole. 

At the same time, abundance matching offers these con- 



straints with a minimal number of parameters. The Halo 
Occupation Distribution (HOD) technique requires modeling 
P(N\Mh), the probability distribution of the number of galax- 
ies per halo as a function of halo mass, i n several different lu - 
minosity bins. In the model proposed in lZheng et alJ (2007), 
this results in 45 fitted parameters just to models the occupa- 
tion at z = (five parameters for nine luminosity bins). Condi- 
tional Luminosity Function (CLF) modeling requires param- 
eterizing a form for tp(L,Mi,), the number density of galaxies 
as a function of luminosity and host halo mass, which results 
in app roximately a dozen parameters to model occupation at 
z = jCooray 2006). Because of the additional constraints im- 
posed by assuming that each halo hosts a galaxy, our approach 
uses fewer parameters. Abundance matching, as discussed in 
this paper, results in a model with only six independent pa- 
rameters (five to empirically fit the derived SM-HM relation, 
and one to model the scatter in observed stellar masses at fixed 
halo mass) to describe the population of galaxies in halos. 

The abundance matching approach to the SM-HM relation 
requires so few parameters in comparison to other methods 
because of the fairly small scatter (« 0. 16 dex) between stellar 
mass and halo mass at high masses (the scatter has a negligi- 
ble impact on the average SM-HM relation at lower masses), 
and the requirement that satellite galaxies live in satellite ha- 
los (subhalos). It may well be that a more complicated model 
must be adopted for satellites to quantitat ively match th e 
small-scale clustering observations (e.g. IWang et ail 120061) . 
However, such changes will affect the clustering much more 
than the derived SM-HM relation, as suggested by the min- 
imal changes in Figures [7] and [10] for mass scales (< 10 12 5 
Mq) where satellites are a non-negligible fraction of the total 
halo population. 

The largest uncertainties in the SM — HM relation at z < 1 
come from assumptions in converting galaxy luminosities into 
stellar masses, which amount to uncertainties on the order of 
0.25 dex in the normalization of the relation. However, the 
systematic biases introduced by the combined sources of scat- 
ter between calculated stellar masses and halo masses can rise 
to equivalent significance for halos above 10 14 5 Mq. Because 
the GSMF is monotonically decreasing, results which do not 
account for all sources of scatter in stellar mass will over- 
predict the average stellar mass in halos by 0.17-0.25 dex for 
these massive halos. 

Using abundance matching to find confidence intervals for 
the SM-HM relation is an even more involved process, as 
each of the ways in which the systematics might vary must 
also be taken into account. While future work on constrain- 
ing stellar masses will be the most valuable in terms of re- 
ducing uncertainties for the lowest redshift data, wider and 
deeper surveys and some resolution to the discrepancy be- 
tween high-redshift cosmic star formation density and stellar 
mass functions must occur in order to improve constraints on 
the relation at high redshifts. 

As mentioned in the introduction, abundance matching may 
be used equally well to assign galaxy luminosities and colors 
to halos. In this case, the galaxy luminosity — halo mass 
relation may be derived using identical methodology to that 
presented in $3] with the exception that the systematics /x and 
K may be neglected, leaving only a(z) (effectively, the redshift 
scaling of photometry errors) and £ (effectively, the scatter in 
luminosity at fixed halo mass). As these systematics are much 
better constrained than their stellar mass counterparts (simply 
as luminosities may be measured directly), this approach can 
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yield very powerful constraints on the normalization as well 
as the evolution of the luminosity-mass relation. The higher 
accuracy possible compared to the stellar mass-mass rela- 
tion will generally not remove uncertainties in comparing to 
galaxy formation models. Galaxy formation codes which cal- 
cul ate lum inosities must include modeling for all the effects 
in $2.1.11 meaning that constraints on the underlying physics 
are subject to the same uncertainties. However, tighter con- 
straints on the luminosity-mass relation will be nonetheless 
helpful for applications which are concerned with cosmolog- 
ical constraints from large luminosity-selected surveys. 

7. CONCLUSIONS 

We have performed an extensive exploration of the uncer- 
tainties relevant to determining the relationship between dark 
matter halos and galaxy stellar masses from the halo abun- 
dance matching technique. Errors related to the observed stel- 
lar mass function, the theoretical halo mass function, and the 
underlying technique of abundance matching are all consid- 
ered. We focus on the mean stellar mass to halo mass ratio 
for central galaxies as a function of halo mass, and present re- 
sults for this relationship at the present epoch and extending to 
z ~ 4. We account separately for statistical errors and for sys- 
tematic errors resulting from uncertainties in stellar mass es- 
timation, and also investigate the relative contribution of var- 
ious sources of error including cosmological uncertainty and 
the scatter between stellar mass and halo mass. An analytic 
model has been developed which can be used to constrain this 
connection in the absence of high resolution simulations. 

Our primary conclusions are as follows: 

1 . The peak integrated star formation efficiency occurs at 
a halo mass near 10 12 M Q , with a relatively low frac- 
tion — 20% at z = — of baryons currently locked 
up in stars. This peak value declines to z ~ 2 but re- 
mains between 10-20% for all redshifts between z =0- 
4. This implies that 30-40% of baryons were converted 
into stars over the lifetime of a galaxy with current halo 
mass of 10 12 M©. 

2. At low masses, the stellar mass - halo mass relation at 
Z = scales as ~ M 23 . At high masses, around 10 14 
M Q , stellar mass scales as ~ M® 29 . However, the 
high mass scaling may not be a power law, as our model 
indicates that this slope decreases with increasing halo 
mass. 

3. Within statistical uncertainties, the stellar mass content 
of halos has increased by 0.3-0.45 dex for halos with 
mass less than 10 12 M Q since z ~ 1. Systematic biases 
in stellar mass calculations between different redshifts 
could broaden the uncertainties in this number, but the 
conclusion that significant evolution has occurred for 
low-mass halos would remain robust. For halos with 
mass greater than 10 13 M Q , our best-fit results indicate 
more growth in halo masses than stellar masses since 
Z ~ 1, but are consistent with no evolution in the stellar 
mass fractions over this time. 

4. Systematic, uniform offsets in the galaxy stellar mass 
function and its evolution are the dominant uncertainty 
in the stellar mass - halo mass relation at low redshift. 
Statistical errors in the estimation of individual stellar 
masses impact the high mass end of the GSMF, and at 



higher redshifts may result in an observed GSMF that 
deviates from a Schechter function. 

5. Current uncertainties in the underlying cosmological 
model are sub-dominant to the systematic errors, but 
are larger than other sources of statistical error for ha- 
los below M h ~ 10 12 M for low redshifts (z < 0.2). 

6. Given current constraints from other methods, uncer- 
tainty in the value of scatter between stellar mass and 
halo mass is important in the mean relation for masses 
above M/, ~ 10 12 5 M Q , although it is subdominant to 
systematic errors for all masses below Mh ~ 10 14,5 M Q . 

7. Other uncertainties in the galaxy-halo assignment, in- 
cluding different assumptions about the treatment of 
satellite galaxies, are subdominant when considering 
the mean relation for central galaxies. 

8. At higher redshifts (1 < z < 4), systematic uncertain- 
ties remain important, but statistical uncertainties reach 
equal significance. The shape of the relation is fairly 
unconstrained at z > 2, where improved statistics and 
constraints on the GSMF below M* are needed. 

We have presented a best-fit galaxy stellar mass - halo mass 
relation including an estimate of the total statistical and sys- 
tematic errors using available data from z = 0-4, although 
caution should be used at redshifts higher than z ~ 1. We 
also present an algorithm to generalize this relation for an ar- 
bitrary cosmological model or halo mass function. The fact 
that assignment errors are sub-dominant and scatter can be 
well-constrained by other means gives increased confidence 
in using the simple abundance matching approach to constrain 
this relation. These results provide a powerful constraint on 
models of galaxy formation and evolution. 
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APPENDIX 

CONVERTING RESULTS TO OTHER HALO MASS FUNCTIONS 

From Equation[l4j it is possible to simply convert from our halo mass function 0h to any halo mass function of choice (<fih,i)- In 
particular, the function Mh(M„) is defined by the fact that the number density of halos with mass above Mh(M„) must match the 
number density of galaxies with stellar mass above M* (with the appropriate deconvolution steps applied). Recall from Equation 
Qllthat 



<^h(M)dlog 10 M= / </> direct (M»)rflog I0 M». (Al) 

M h (M„) JM, 

Naturally, the correct mass-stellar mass relation for the alternate halo mass function 0hr (which we will label as Mh, r (M»)) must 
satisfy this same equation, with the result that: 

/•oo />oo 

/ 4(Myiog 10 M= / <fr hr (M)dlog l0 M. (A2) 

JMi,(M,) JMhAM.) 

To make the calculation even more explicit, let &h(M) = J M cf>hd\og w M be our cumulative halo mass function, and let <fr^ r (M) 
be the corresponding cumulative halo mass function for </>/, ,.. Then, we find: 

M h>r (M«) = $h'($h(M h (M*))). (A3) 

Mass functions from different cosmologies than those assumed in this paper will also require converting stellar masses if their 
choices of h differ from the WMAP5 best-fit value. 

EFFECTS OF SCATTER ON THE STELLAR MASS FUNCTION 

This section is intended to provide basic intuition for the effects of both £ and <r(z), which may be modeled as convolutions. The 
classic example in this case is convolution of the GSMF with a log-normal distribution of some width lu. While the convolution 
(even of a Schechter function) with a Gaussian has no known analytical solution, we may approximate the result by considering 
the case where the logarithmic slope of the GSMF changes very little over the width of the Gaussian. Then, locally, the stellar 
mass function is proportional to a power function, say 0(M„) oc (M*) a . Then, if we let x = log 10 M„ (so that 0(10') oc 10 QJC ), 
finding the convolution is equivalent to calculating the following integral: 

</Ww(10 l ) oc / exp - db 

= 10 <" 10 K" 2 ln(10)_ (Bl) 

That is to say, the stellar mass function is shifted upward by approximately 1.1 5 (aw) 2 dex. Hence, for parts of the stellar mass 
function with shallow slopes, the shift is completely insignificant, as it is proportional to a 2 . However, it matters much more in 
the steeper part of the stellar mass function, to the point that for galaxies of mass 10 12 M , the observed stellar mass function can 
be several orders of magnitude above the intrinsic GSMF. 

A SAMPLE CALCULATION OF THE FUNCTIONAL FORM OF THE STELLAR MASS FUNCTION 

Galaxy formation models typically assume at least two feedback mechanisms to limit star formation for low-mass galaxies 
and for high-mass galaxies. Thus, one of the simplest fiducial star formation rate (SFR) as a function of halo mass (M/,) would 
assume a double power-law form: 

smMh ^(^X + (^)\ (C1) 



Mo J \M 

We might expect the total stellar mass as a function of halo mass to take a similar form, except perhaps with a wider region of 
transition between galaxies whose histories are predominantly low mass, and those with histories which are predominantly high 
mass, for the reason that some galaxies' accretion histories may have caused them to be affected comparably by both feedback 
mechanisms. 

Hence, assuming that the relation between halo mass and stellar mass follows a double power-law form, we adopt a simple 
functional form to convert from the stellar mass of a galaxy to the halo mass: 



M h (M*)=M l 



M* ( M, 



+ 



M* n / V M 



(C2) 



Here, (3 may be thought of as the faint-end slope, 5 as the massive-end slope (although (3 and S are functionally interchangeable), 
and 7 as the transition width (larger 7 means a slower transition between the massive and faint-end slopes). 
We first calculate «i: 
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\og{M h ) = log(Mi) + 7 log 

d\og(M h ) _ d\og(M h ) dM, 
<ilog(M*) 



~M 



*.o 



*.o 



5/7' 



dM* dlog(M*) 
dlog(Af A ) 



= M» ln(10) 



5/7 



(M*.y h + (M*.) 



5/7 



= /9+(<5-/3) 1 + 



M : 



*.o 



(C3) 

(C4) 
(C5) 

(C6) 
(C7) 



This justifies the earlier intuition that the functional form for M h (M. t ) transitions between slopes of j3 and 5 with a width that 
increases with 7. Note that jj^Q is always of order one, as the stellar mass is always assumed to increase with the halo mass 
and vice versa (namely, (3 > and S > 0). 



Next, we approach dlogMh ■ From analytical results, we expect a Schechter function for the halo mass function, namely: 

dN 



:^ ln(10) ( — 



(C8) 



Substituting in the equation for Mft(M*), we have 



-1 7(l-a) 



d\og(M h ) 



Mh 
M 



60 ln(10) 



exp 




(C9) 



Already evident is the generic result that there will be separate faint-end and massive-end slopes in the stellar mass function, 
and that the falloff is not generically specified by an exponential. We may make one simplification in this model — namely, to 
note that M;,(M* ) o) corresponds to the halo mass at which the slope of M^(M*) begins to transition from (3 to S. We expect this 
transition to correspond to the transition between supernova feedback and AGN feedback in semi-analytic models — namely, for 
a halo mass which is too large to be affected much by supernova feedback, but which is yet too small to host a large AGN. This 
implies that Mj,(M*jo) is expected to be around 10 12 M Q or less, meaning that Mi, /Mo is small until stellar masses well beyond 
M*fl, meaning that we may neglect the faint-end slope of the M/,(M„) relation in the exponential portion of the stellar mass 
function: 



d\og(M h ) \M Q 



x exp 




-1 7(l-a) 



Mi ( M, 



(CIO) 



Hence, we may combine these two equations to obtain the expression for the stellar mass function: 

7(l-a) 

, M, \ ' / M, \ 1 M \ ' 

Mn(10) 



dN 



dlog(M*) 



/M, 


1-a 


\M Q 




Mi / 


M* 


M V 


M*. 


-P) 





M 



*,o 



M 



*.o 



M„ 



(Cll) 
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While this seems complicated, it may be intuitively deconstructed as: 

dN r n 
= [constant] double power law 

x [exponential dropoff] 0{ 1 ) . (C 1 2) 

As mentioned previously, this functional form is equivalent to (^direct- To convert to the true stellar mass function <p tme or the 
observed stellar mass function </> meas , it must be convolved with the scatter in stellar mass at fixed halo mass and (for mea s) 
the scatter in calculated stellar mass at fixed true stellar mass. As such, it should be clear that — while the final form may be 
Schechter-Z/£e — there is certainly much more flexibility in the final shape of the GSMF than a Schechter function alone would 
allow, as evidenced by the five parameters required to fully specify equation [UTTJ 

DATA TABLES 

We reproduce here listings of the data points in Figures|5]|6] and[T3lin Tables[3]IU and|5] respectively. See sections |4~2l and |5^2l 
for details on the data points in each table. 



Table 3 

Stellar Mass Fractions For < z < 1 Including Full Systematics 



z = 0.1 z = 0.5 z=1.0 

log 10 (M,,) log 10 (M«/M/,) log 10 (M./M,,) log 10 (M»/M ; ,) 



11.00 


_ 2 30+0.26 
J -0.23 






11.25 


_i q 6 +0.25 
lyo -0.23 


-2 1 l +0 - 22 




11.50 


-1 67+ 024 

1 -°'-0.24 


-1 84+ ' 22 


-2 01+ 25 

z ' ul -0.24 


11.75 


-1 53+ 023 
-"-0.24 


-1 70 +a24 


i oc+0.26 
i.o J -0.23 


12.00 


1 =4+0.24 
1 - JH -0.24 


i fic+0.26 


i 77+0.26 
1 -"-0.23 


12.25 


-1 62+ 024 
1 - DZ -0.24 


i 79+0.26 
'■ / z -0.23 


i 77+O.25 
1 -"-0.23 


12.50 


_j 7 4+0.24 
1 -'^-0.24 


1 SI +0-25 


i j, +0.24 


12.75 


i O7+0.23 


i 09+0.25 
l - y ^-0.23 


1 - OV -0.26 


13.00 


-2 02+ 023 
z ' uz -0.25 


_ 2 05+ - 24 

Z.U9_ 24 


i qq+0.22 
lyy -0.27 


13.25 


r, IO+0.22 

~ Z - 1S -0.26 


9 iq+0.24 
z - 1 ^-0.25 


-2 11+ 021 
z - 11 -0.28 


13.50 


9 ^ir+0.22 
z -' J -0.26 


_9 3 4+0.23 
Z - J -0.25 


9 9=+0.21 
z ' ZJ -0.29 


13.75 


9 =9+0.22 
z - JZ -0.27 


z -"-0.26 


9 5Q+0.21 
z ""-0.30 


14.00 


_9 70+ - 21 
z - -0.28 




9 cc+0.22 
-0.30 


14.25 


9 OO+0.2I 






14.50 


_n 07 +0.20 


-3.04«f 




14.75 


_n 26 +0.20 
J ' zu -0.30 






15.00 


_n 4 =+0.20 







Note. — Halo masses are in units of Mg . Constraints are quoted over the mass range probed by the observed GSMF. 
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Table 4 

Stellar Mass Fractions without Systematic Errors (fj, = k - 0) 



z = 0.1 z = 0.5 z = 1.0 



log 10 (Mj,) log 10 (M»/M;,) log 10 (M»/Mj,) log 10 (M»/M;,) 



11.00 


-2.30 + 0'03 

— v.DZ 






11.25 


-1.96^-W 

— U.U1 


-2.10+004 

— U.Uo 




11.50 


-1.67+003 

-U.U1 


-U.Uo 


-2.02+010 

-0.06 


11.75 


-1 

— U.U1 


_j 7 j +0.06 
—0.03 


—0.04 


12.00 


_, 54+O.OI 

—0 02 


-1.68+oo« 


I 7C+0.09 
' -0 O i 


12.25 


-1 62+ o:o ° 
1 - -0.02 


i 77+O.O6 
^ ' -0.01 


i 77+0!08 
1 -"-0.03 


12.50 


_, 74 +0.01 
1 -' -0.02 


i e 1+0.05 
_1 - S1 -0.02 


1 81+0.06 
_1 - M -0.04 


12.75 


1 07+O.OI 


1 Q7+0.05 


i oc+0.04 
_1 - SS -0.06 


13.00 


-2 02+ 01 
z - uz -0.03 


-2 05+0' 04 
z - -0.03 


-1 98+° 03 
1 -'°-0.08 


13.25 


7 i O+0.02 
~ z - 1S -0.04 


_7 m+0.04 
z - lv -()()4 


-2 10+ 004 
z - lu -o.io 


13.50 


7 oc+0.02 
z - -0.05 


_7 14 +0.04 
z - JH -0.05 


_7 74+0.04 
Z ' ZH -0.13 


13.75 


7 <;7+0.03 
z - JZ -0.06 


7 51 +0.05 
z - -0.06 


7 -jo+0.05 


14.00 


-2 70+ 03 
z ' /u -0.07 


-2 68+ 005 

z - -0.07 


_ 2 C4+O.O6 
Z ' JH -0.17 


14.25 


7 oc+0.04 


7 oc+0.06 
° -0.08 




14.50 


J - u '-0.09 


J ' u M).10 




14.75 


7 7^+0.05 

-'• ZJ -o.io 






15.00 


o 4< :+0.05 

7.tJ_ n , , 







Note. — Halo masses are in units of Mq. 



Table 5 

Stellar Mass Fractions For 0.8 < z < 4 Including Full Systematics 



z=1.0 z = 2.0 z = 3.0 z = 4.0 



log 10 (M ; ,) log 10 (M»/M,,) log 10 (M»/M ; ,) log l0 (M,/M h ) \og w (M,/M h ) 



11.50 


-2 01+0-25 
z - ul -0.24 








11.75 


i 8=+0.26 

-0.23 








12.00 


i 77+0.26 
'-0.23 


-1 89+ ' 22 


-1 S9+0- 25 




12.25 


_i 77+0.25 
'■' '-0.23 


-™<%% 


-1 67+0- 2 ' 


i co+0.22 
~ LM -0.32 


12.50 


i o 1+0.24 




-1 63+0- 2 ' 

1 -°-'-0.30 


-1 S0+ - 20 
1 - 3u -0.35 


12.75 


-1 89+0- 23 
1 ' oy -0.26 


i O7+0.20 
_1 - 8/ -0.30 


-1 71+0-19 
1 - /1 -0.35 




13.00 


_i nn+0.22 
1 - w -0.27 


-2 00+ ' 7 
z ' uu -0.35 


-1.83+0;^ 


-1-68^ 


13.25 


-2 1 l+°- 2 i 

z ' 11 -0.28 


_7 14+015 
z - -0.39 


_, n 7 +0.16 
1 - 5 "-0.44 


i 07+0. 19 
_1 - Sz -0.49 


13.50 


7 7C+0.21 
z ' ZJ -0.29 


_7 7(1+0.15 


-2 n+o-17 

z ' 1J -0.47 


-1 98+ ' 9 
1 - yo -0.52 


13.75 


7 7Q+0.21 
J7 -0.30 


_7 47+O.I7 
z -^'-0.45 


-2 30+0-' 9 
z - JU -0.49 




14.00 


7 c s +0.22 
z ""-0.30 


_7 fi4 +0.20 
z -°<t-047 







Note. 



Halo masses are in units of Mq. 



